; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008291 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008291
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function, DUF547
Genome locationchr9:16849711..16853402
RNA-Seq ExpressionLag0008291
SyntenyLag0008291
Gene Ontology termsNA
InterPro domainsIPR006869 - Domain of unknown function DUF547
IPR025757 - Ternary complex factor MIP1, leucine-zipper


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158699.1 uncharacterized protein LOC111025163 isoform X1 [Momordica charantia]1.2e-29989.7Show/hide
Query:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL
        DK+A++DDRL NSLE  N V+MDM++VKEVEN KN S K+EVHNSLKQEI QLEKRLQDQFKLRSALEKALGHG+FPC+KSDK+SMPKSA+ELI EIATL
Subjt:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL

Query:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV
        ELEVVHLEQYLLSLYR+AFDGQSSSVSPSA DEKSKLPSTPRG SME PLPDIASKDE SAAPSGCQSLEN RK+CS+IGRDE L++SNFHRS+SSLTTV
Subjt:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV

Query:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF
        NAA+ +K+STSVESLD+TL ACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDM+KCIS+I+CKLAEPSL+NHGLSSPTSSLSSVSAF
Subjt:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF

Query:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT
        SPGEQCAMCSPG RNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKL +V+SLLENFRLLISRLEEVDL KLKYEEKLAFWINIHNSLVMHT
Subjt:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT

Query:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV
        YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTI+SCILGCRMPRPRQWLRLLLPSRTKFKTGDER+AYII+RPEPLLHFALCSGSHSDPAVRVYT+KRV
Subjt:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV

Query:  FQELESAKDEYIRATFGVRKDQK-ILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRA-ALGNPRKNVEWIPPSYTFRYLISKELM
        FQELESAKDEYIRATFGV KD+K ILLPK IESFAKDSGLC+SGLMEMIL SLPESLRKSVKR+  LGNPRKNVEWIPPSYTFRYLISKEL+
Subjt:  FQELESAKDEYIRATFGVRKDQK-ILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRA-ALGNPRKNVEWIPPSYTFRYLISKELM

XP_022158700.1 uncharacterized protein LOC111025163 isoform X2 [Momordica charantia]1.2e-29989.7Show/hide
Query:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL
        DK+A++DDRL NSLE  N V+MDM++VKEVEN KN S K+EVHNSLKQEI QLEKRLQDQFKLRSALEKALGHG+FPC+KSDK+SMPKSA+ELI EIATL
Subjt:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL

Query:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV
        ELEVVHLEQYLLSLYR+AFDGQSSSVSPSA DEKSKLPSTPRG SME PLPDIASKDE SAAPSGCQSLEN RK+CS+IGRDE L++SNFHRS+SSLTTV
Subjt:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV

Query:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF
        NAA+ +K+STSVESLD+TL ACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDM+KCIS+I+CKLAEPSL+NHGLSSPTSSLSSVSAF
Subjt:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF

Query:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT
        SPGEQCAMCSPG RNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKL +V+SLLENFRLLISRLEEVDL KLKYEEKLAFWINIHNSLVMHT
Subjt:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT

Query:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV
        YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTI+SCILGCRMPRPRQWLRLLLPSRTKFKTGDER+AYII+RPEPLLHFALCSGSHSDPAVRVYT+KRV
Subjt:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV

Query:  FQELESAKDEYIRATFGVRKDQK-ILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRA-ALGNPRKNVEWIPPSYTFRYLISKELM
        FQELESAKDEYIRATFGV KD+K ILLPK IESFAKDSGLC+SGLMEMIL SLPESLRKSVKR+  LGNPRKNVEWIPPSYTFRYLISKEL+
Subjt:  FQELESAKDEYIRATFGVRKDQK-ILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRA-ALGNPRKNVEWIPPSYTFRYLISKELM

XP_038900882.1 uncharacterized protein LOC120087938 isoform X1 [Benincasa hispida]2.1e-30190.68Show/hide
Query:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL
        DK+ IRDDRLDN LE SNTVKMDM+ VKEVENKKNSSPK+EVHNSLKQEI QLEKRLQDQFKLRSALEK LGHGVF CN+SDK+SMPKSA+ELI+EIATL
Subjt:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL

Query:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV
        ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEK KLP TPRGR+MEAP PDIASK  +SA PS CQSLEN RK+ SDIGRDE LL+SN+HRSQSSLTTV
Subjt:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV

Query:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF
        NAA+LDK+STSVESLD+TLR CHSQPVSMMEYAQNVS NIISLAEHLGTRISDH+PETPNRLSEDM+KCISTIY KLAEPSL NHGLSSPTSSLSSVSAF
Subjt:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF

Query:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT
        SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLK+FSGPYSTMIEISWIC DPQKL  V+SLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT
Subjt:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT

Query:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV
        YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTI+S ILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC GSHSDPAVRVYT KRV
Subjt:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV

Query:  FQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRAALGNPRKNVEWIPPSYTFRYLISKELM
        FQELE+AKDEYIRATFGV KDQKILLPK+IESFAKD+GLC+SGLMEMILKSLPESLRKSVKR+ LGNPRKNVEWIP SYTFRYLISKELM
Subjt:  FQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRAALGNPRKNVEWIPPSYTFRYLISKELM

XP_038900883.1 uncharacterized protein LOC120087938 isoform X2 [Benincasa hispida]2.1e-30190.68Show/hide
Query:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL
        DK+ IRDDRLDN LE SNTVKMDM+ VKEVENKKNSSPK+EVHNSLKQEI QLEKRLQDQFKLRSALEK LGHGVF CN+SDK+SMPKSA+ELI+EIATL
Subjt:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL

Query:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV
        ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEK KLP TPRGR+MEAP PDIASK  +SA PS CQSLEN RK+ SDIGRDE LL+SN+HRSQSSLTTV
Subjt:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV

Query:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF
        NAA+LDK+STSVESLD+TLR CHSQPVSMMEYAQNVS NIISLAEHLGTRISDH+PETPNRLSEDM+KCISTIY KLAEPSL NHGLSSPTSSLSSVSAF
Subjt:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF

Query:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT
        SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLK+FSGPYSTMIEISWIC DPQKL  V+SLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT
Subjt:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT

Query:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV
        YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTI+S ILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC GSHSDPAVRVYT KRV
Subjt:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV

Query:  FQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRAALGNPRKNVEWIPPSYTFRYLISKELM
        FQELE+AKDEYIRATFGV KDQKILLPK+IESFAKD+GLC+SGLMEMILKSLPESLRKSVKR+ LGNPRKNVEWIP SYTFRYLISKELM
Subjt:  FQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRAALGNPRKNVEWIPPSYTFRYLISKELM

XP_038900884.1 uncharacterized protein LOC120087938 isoform X3 [Benincasa hispida]1.4e-29291.04Show/hide
Query:  MDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATLELEVVHLEQYLLSLYRKAFDG
        MDM+ VKEVENKKNSSPK+EVHNSLKQEI QLEKRLQDQFKLRSALEK LGHGVF CN+SDK+SMPKSA+ELI+EIATLELEVVHLEQYLLSLYRKAFDG
Subjt:  MDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATLELEVVHLEQYLLSLYRKAFDG

Query:  QSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTVNAAALDKVSTSVESLDKTLRA
        QSSSVSPSAKDEK KLP TPRGR+MEAP PDIASK  +SA PS CQSLEN RK+ SDIGRDE LL+SN+HRSQSSLTTVNAA+LDK+STSVESLD+TLR 
Subjt:  QSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTVNAAALDKVSTSVESLDKTLRA

Query:  CHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAFSPGEQCAMCSPGFRNNSSFDV
        CHSQPVSMMEYAQNVS NIISLAEHLGTRISDH+PETPNRLSEDM+KCISTIY KLAEPSL NHGLSSPTSSLSSVSAFSPGEQCAMCSPGFRNNSSFDV
Subjt:  CHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAFSPGEQCAMCSPGFRNNSSFDV

Query:  RLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSA
        RLDNPFLVEGLK+FSGPYSTMIEISWIC DPQKL  V+SLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSA
Subjt:  RLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSA

Query:  YNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRVFQELESAKDEYIRATFGVRKD
        YNIGGHTISVDTI+S ILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALC GSHSDPAVRVYT KRVFQELE+AKDEYIRATFGV KD
Subjt:  YNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRVFQELESAKDEYIRATFGVRKD

Query:  QKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRAALGNPRKNVEWIPPSYTFRYLISKELM
        QKILLPK+IESFAKD+GLC+SGLMEMILKSLPESLRKSVKR+ LGNPRKNVEWIP SYTFRYLISKELM
Subjt:  QKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRAALGNPRKNVEWIPPSYTFRYLISKELM

TrEMBL top hitse value%identityAlignment
A0A1S3C5X2 uncharacterized protein LOC103497400 isoform X11.5e-28987.12Show/hide
Query:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL
        DK+A++DDRL+N LE SNTVK+DM++VKEVENKKNSS K+ VH+SLKQEI QLEKRLQDQFKLRSALEK LGHGVFPCN+SDKISMPKSA+ELI+EIA L
Subjt:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL

Query:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV
        E+EVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEK+KLPSTP GR MEAPLPDIA K  +SA PS C SL+N R+D SDIGRDE LL+ ++ RSQSSLTTV
Subjt:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV

Query:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF
        NA +L KVSTSVESLD+TLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDH+PETPNRLSEDM+KCISTIY KLAEPS  N GLSSP SSLSSVSAF
Subjt:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF

Query:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT
        SPGEQ AMCSPGFRNNSSFDV LDNPFLVEGLK+FSGPYSTMIEISWI GDPQKL  V+SLLENFRLLISRLEEVDLR L YEEKLAFWINIHNSLVMHT
Subjt:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT

Query:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV
        YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTI+SCILGCRMPRPRQWLRLLLPSRTKFK GDERQ YI+DRPEPLLHFALCSGSHSDPAVRVYT KRV
Subjt:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV

Query:  FQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRAALGNPRKNVEWIPPSYTFRYLISKELM
        FQELE++KDEYIRATFGVRKDQKILLPK+IESF KDSGLC+ GLMEMILKSLPESLRKSVKR+ LGNPRK VEWIPP+YTFRYLISKEL+
Subjt:  FQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRAALGNPRKNVEWIPPSYTFRYLISKELM

A0A5A7TUI2 Uncharacterized protein1.5e-28987.12Show/hide
Query:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL
        DK+A++DDRL+N LE SNTVK+DM++VKEVENKKNSS K+ VH+SLKQEI QLEKRLQDQFKLRSALEK LGHGVFPCN+SDKISMPKSA+ELI+EIA L
Subjt:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL

Query:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV
        E+EVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEK+KLPSTP GR MEAPLPDIA K  +SA PS C SL+N R+D SDIGRDE LL+ ++ RSQSSLTTV
Subjt:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV

Query:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF
        NA +L KVSTSVESLD+TLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDH+PETPNRLSEDM+KCISTIY KLAEPS  N GLSSP SSLSSVSAF
Subjt:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF

Query:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT
        SPGEQ AMCSPGFRNNSSFDV LDNPFLVEGLK+FSGPYSTMIEISWI GDPQKL  V+SLLENFRLLISRLEEVDLR L YEEKLAFWINIHNSLVMHT
Subjt:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT

Query:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV
        YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTI+SCILGCRMPRPRQWLRLLLPSRTKFK GDERQ YI+DRPEPLLHFALCSGSHSDPAVRVYT KRV
Subjt:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV

Query:  FQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRAALGNPRKNVEWIPPSYTFRYLISKELM
        FQELE++KDEYIRATFGVRKDQKILLPK+IESF KDSGLC+ GLMEMILKSLPESLRKSVKR+ LGNPRK VEWIPP+YTFRYLISKEL+
Subjt:  FQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRAALGNPRKNVEWIPPSYTFRYLISKELM

A0A5D3BMC4 Uncharacterized protein1.5e-28987.12Show/hide
Query:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL
        DK+A++DDRL+N LE SNTVK+DM++VKEVENKKNSS K+ VH+SLKQEI QLEKRLQDQFKLRSALEK LGHGVFPCN+SDKISMPKSA+ELI+EIA L
Subjt:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL

Query:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV
        E+EVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEK+KLPSTP GR MEAPLPDIA K  +SA PS C SL+N R+D SDIGRDE LL+ ++ RSQSSLTTV
Subjt:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV

Query:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF
        NA +L KVSTSVESLD+TLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDH+PETPNRLSEDM+KCISTIY KLAEPS  N GLSSP SSLSSVSAF
Subjt:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF

Query:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT
        SPGEQ AMCSPGFRNNSSFDV LDNPFLVEGLK+FSGPYSTMIEISWI GDPQKL  V+SLLENFRLLISRLEEVDLR L YEEKLAFWINIHNSLVMHT
Subjt:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT

Query:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV
        YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTI+SCILGCRMPRPRQWLRLLLPSRTKFK GDERQ YI+DRPEPLLHFALCSGSHSDPAVRVYT KRV
Subjt:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV

Query:  FQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRAALGNPRKNVEWIPPSYTFRYLISKELM
        FQELE++KDEYIRATFGVRKDQKILLPK+IESF KDSGLC+ GLMEMILKSLPESLRKSVKR+ LGNPRK VEWIPP+YTFRYLISKEL+
Subjt:  FQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRAALGNPRKNVEWIPPSYTFRYLISKELM

A0A6J1DWJ5 uncharacterized protein LOC111025163 isoform X25.6e-30089.7Show/hide
Query:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL
        DK+A++DDRL NSLE  N V+MDM++VKEVEN KN S K+EVHNSLKQEI QLEKRLQDQFKLRSALEKALGHG+FPC+KSDK+SMPKSA+ELI EIATL
Subjt:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL

Query:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV
        ELEVVHLEQYLLSLYR+AFDGQSSSVSPSA DEKSKLPSTPRG SME PLPDIASKDE SAAPSGCQSLEN RK+CS+IGRDE L++SNFHRS+SSLTTV
Subjt:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV

Query:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF
        NAA+ +K+STSVESLD+TL ACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDM+KCIS+I+CKLAEPSL+NHGLSSPTSSLSSVSAF
Subjt:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF

Query:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT
        SPGEQCAMCSPG RNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKL +V+SLLENFRLLISRLEEVDL KLKYEEKLAFWINIHNSLVMHT
Subjt:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT

Query:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV
        YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTI+SCILGCRMPRPRQWLRLLLPSRTKFKTGDER+AYII+RPEPLLHFALCSGSHSDPAVRVYT+KRV
Subjt:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV

Query:  FQELESAKDEYIRATFGVRKDQK-ILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRA-ALGNPRKNVEWIPPSYTFRYLISKELM
        FQELESAKDEYIRATFGV KD+K ILLPK IESFAKDSGLC+SGLMEMIL SLPESLRKSVKR+  LGNPRKNVEWIPPSYTFRYLISKEL+
Subjt:  FQELESAKDEYIRATFGVRKDQK-ILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRA-ALGNPRKNVEWIPPSYTFRYLISKELM

A0A6J1E058 uncharacterized protein LOC111025163 isoform X15.6e-30089.7Show/hide
Query:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL
        DK+A++DDRL NSLE  N V+MDM++VKEVEN KN S K+EVHNSLKQEI QLEKRLQDQFKLRSALEKALGHG+FPC+KSDK+SMPKSA+ELI EIATL
Subjt:  DKQAIRDDRLDNSLELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATL

Query:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV
        ELEVVHLEQYLLSLYR+AFDGQSSSVSPSA DEKSKLPSTPRG SME PLPDIASKDE SAAPSGCQSLEN RK+CS+IGRDE L++SNFHRS+SSLTTV
Subjt:  ELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTV

Query:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF
        NAA+ +K+STSVESLD+TL ACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDM+KCIS+I+CKLAEPSL+NHGLSSPTSSLSSVSAF
Subjt:  NAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAF

Query:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT
        SPGEQCAMCSPG RNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKL +V+SLLENFRLLISRLEEVDL KLKYEEKLAFWINIHNSLVMHT
Subjt:  SPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHT

Query:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV
        YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTI+SCILGCRMPRPRQWLRLLLPSRTKFKTGDER+AYII+RPEPLLHFALCSGSHSDPAVRVYT+KRV
Subjt:  YLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRV

Query:  FQELESAKDEYIRATFGVRKDQK-ILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRA-ALGNPRKNVEWIPPSYTFRYLISKELM
        FQELESAKDEYIRATFGV KD+K ILLPK IESFAKDSGLC+SGLMEMIL SLPESLRKSVKR+  LGNPRKNVEWIPPSYTFRYLISKEL+
Subjt:  FQELESAKDEYIRATFGVRKDQK-ILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRA-ALGNPRKNVEWIPPSYTFRYLISKELM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G23700.1 Protein of unknown function, DUF5479.3e-14645.06Show/hide
Query:  GCENRTQPLQKH---------DKQAIRDDR-LDNSLELSNTVKMDMNQV--KEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVF
        G E++    Q+H         +K+ + D+  +D+SL+ S  +K+D+ +   K  E KK+ SP ++  +SLKQEI +LEKRLQ+QF +R ALEKALG+   
Subjt:  GCENRTQPLQKH---------DKQAIRDDR-LDNSLELSNTVKMDMNQV--KEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVF

Query:  PCNKSDKISMPKSAMELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVS-PSAKDEKSKLP-STPRGRSM-------------------------EAP
        P       S PK   ELI+EIA LELEV HLEQYLLSLYRKAFD Q+SSVS P++K + S  P ST RG+ +                         E  
Subjt:  PCNKSDKISMPKSAMELIREIATLELEVVHLEQYLLSLYRKAFDGQSSSVS-PSAKDEKSKLP-STPRGRSM-------------------------EAP

Query:  LPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLIS---------------------NFHRSQSSLTTVNAAALDKVSTSV----------------
         P++  + ES A    C S +N+ K+ S  GR  N  +S                     +F++  S + +   +  ++V   V                
Subjt:  LPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLIS---------------------NFHRSQSSLTTVNAAALDKVSTSV----------------

Query:  ----------------ESLDKTLR------------------------ACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCIST
                        E +D  +R                        ACHSQP+S+ EY QN  SN  SLAEH+GTRISDH+  TPN+LSE+M+KC S 
Subjt:  ----------------ESLDKTLR------------------------ACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCIST

Query:  IYCKLAEPSLSNHGLSSPTSSLSSVSAFSPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRL
        IY KLA+P   NHG SSP+SS SS S FSP +Q  M SP FR NSSFD +           +FSGPYS+MIE+S I  + +K RD++ +  NF LL+ +L
Subjt:  IYCKLAEPSLSNHGLSSPTSSLSSVSAFSPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRL

Query:  EEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIID
        E VD RKL ++EKLAFWIN+HN+LVMHT+LA G+PQNN KR  LL K AY IGG  +S++ I+S IL  +MPRP QWL+LLL  + KF+TGDE Q Y ++
Subjt:  EEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIID

Query:  RPEPLLHFALCSGSHSDPAVRVYTSKRVFQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRAALGNPRKN-
          EPLL+FALCSG+HSDPA+RV+T K ++QELE+AK+EYIRATFGV+KDQK++LPK+IESF+KDSGL  + LMEMI + LPE+++K++K+   G  RK+ 
Subjt:  RPEPLLHFALCSGSHSDPAVRVYTSKRVFQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRAALGNPRKN-

Query:  VEWIPPSYTFRYLISKELM
        VEW P ++ FRYLI++EL+
Subjt:  VEWIPPSYTFRYLISKELM

AT5G66600.1 Protein of unknown function, DUF5472.1e-17457.79Show/hide
Query:  DKQAIRDDRLDNSL-ELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGH---GVFPCNKSDKISMPKSAMELIRE
        +K+ +  D+  NS  E S  +K+DM +  E    K++        SLKQEIT LE RLQDQFK+R ALEKALG+     +   +++ I+MPK A +LI++
Subjt:  DKQAIRDDRLDNSL-ELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGH---GVFPCNKSDKISMPKSAMELIRE

Query:  IATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLP--STPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQ
        +A LE+EV+HLEQYLLSLYRKAF+ Q SSVSP+ +++K K P  +TPR R   +   D  SK +    P      +NQ K       D + +  +F RS 
Subjt:  IATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLP--STPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQ

Query:  SSLTTVNAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAE-PSLSNHGLSSPTSS
        S      +A   + ++  +S  K  R+CHSQP+    Y QN   N+ISLAEHLGTRISDHVPETPN+LSE MVKC+S IYCKLAE PS+ + GLSSP SS
Subjt:  SSLTTVNAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAE-PSLSNHGLSSPTSS

Query:  LSSVSAFSPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIH
        LSS SAFSP +Q    SPGF N+SSFDVRLDN F VEG KDFSGPYS+++E+  I  D +K  +VE LL+NF+ LISRLEEVD RKLK+EEKLAFWIN+H
Subjt:  LSSVSAFSPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIH

Query:  NSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVR
        N+LVMH +LAYG+PQNNVKR  LLLK+AYNIGGHTIS + I+S ILGC+M  P QWLRLL  SR KFK GDER AY ID PEPLLHFAL SGSHSDPAVR
Subjt:  NSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVR

Query:  VYTSKRVFQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKR--AALGNPRKNVEWIPPSYTFRYLISKE
        VYT KR+ QELE++K+EYIR    +RK Q+ILLPK++E+FAKDSGLC +GL EM+ +S+PES RK VKR  ++   PRK ++WIP S+TFRYLI +E
Subjt:  VYTSKRVFQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKR--AALGNPRKNVEWIPPSYTFRYLISKE

AT5G66600.2 Protein of unknown function, DUF5472.1e-17457.79Show/hide
Query:  DKQAIRDDRLDNSL-ELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGH---GVFPCNKSDKISMPKSAMELIRE
        +K+ +  D+  NS  E S  +K+DM +  E    K++        SLKQEIT LE RLQDQFK+R ALEKALG+     +   +++ I+MPK A +LI++
Subjt:  DKQAIRDDRLDNSL-ELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGH---GVFPCNKSDKISMPKSAMELIRE

Query:  IATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLP--STPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQ
        +A LE+EV+HLEQYLLSLYRKAF+ Q SSVSP+ +++K K P  +TPR R   +   D  SK +    P      +NQ K       D + +  +F RS 
Subjt:  IATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLP--STPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQ

Query:  SSLTTVNAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAE-PSLSNHGLSSPTSS
        S      +A   + ++  +S  K  R+CHSQP+    Y QN   N+ISLAEHLGTRISDHVPETPN+LSE MVKC+S IYCKLAE PS+ + GLSSP SS
Subjt:  SSLTTVNAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAE-PSLSNHGLSSPTSS

Query:  LSSVSAFSPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIH
        LSS SAFSP +Q    SPGF N+SSFDVRLDN F VEG KDFSGPYS+++E+  I  D +K  +VE LL+NF+ LISRLEEVD RKLK+EEKLAFWIN+H
Subjt:  LSSVSAFSPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIH

Query:  NSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVR
        N+LVMH +LAYG+PQNNVKR  LLLK+AYNIGGHTIS + I+S ILGC+M  P QWLRLL  SR KFK GDER AY ID PEPLLHFAL SGSHSDPAVR
Subjt:  NSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVR

Query:  VYTSKRVFQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKR--AALGNPRKNVEWIPPSYTFRYLISKE
        VYT KR+ QELE++K+EYIR    +RK Q+ILLPK++E+FAKDSGLC +GL EM+ +S+PES RK VKR  ++   PRK ++WIP S+TFRYLI +E
Subjt:  VYTSKRVFQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKR--AALGNPRKNVEWIPPSYTFRYLISKE

AT5G66600.3 Protein of unknown function, DUF5472.1e-17457.79Show/hide
Query:  DKQAIRDDRLDNSL-ELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGH---GVFPCNKSDKISMPKSAMELIRE
        +K+ +  D+  NS  E S  +K+DM +  E    K++        SLKQEIT LE RLQDQFK+R ALEKALG+     +   +++ I+MPK A +LI++
Subjt:  DKQAIRDDRLDNSL-ELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGH---GVFPCNKSDKISMPKSAMELIRE

Query:  IATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLP--STPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQ
        +A LE+EV+HLEQYLLSLYRKAF+ Q SSVSP+ +++K K P  +TPR R   +   D  SK +    P      +NQ K       D + +  +F RS 
Subjt:  IATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLP--STPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQ

Query:  SSLTTVNAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAE-PSLSNHGLSSPTSS
        S      +A   + ++  +S  K  R+CHSQP+    Y QN   N+ISLAEHLGTRISDHVPETPN+LSE MVKC+S IYCKLAE PS+ + GLSSP SS
Subjt:  SSLTTVNAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAE-PSLSNHGLSSPTSS

Query:  LSSVSAFSPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIH
        LSS SAFSP +Q    SPGF N+SSFDVRLDN F VEG KDFSGPYS+++E+  I  D +K  +VE LL+NF+ LISRLEEVD RKLK+EEKLAFWIN+H
Subjt:  LSSVSAFSPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIH

Query:  NSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVR
        N+LVMH +LAYG+PQNNVKR  LLLK+AYNIGGHTIS + I+S ILGC+M  P QWLRLL  SR KFK GDER AY ID PEPLLHFAL SGSHSDPAVR
Subjt:  NSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVR

Query:  VYTSKRVFQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKR--AALGNPRKNVEWIPPSYTFRYLISKE
        VYT KR+ QELE++K+EYIR    +RK Q+ILLPK++E+FAKDSGLC +GL EM+ +S+PES RK VKR  ++   PRK ++WIP S+TFRYLI +E
Subjt:  VYTSKRVFQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKR--AALGNPRKNVEWIPPSYTFRYLISKE

AT5G66600.4 Protein of unknown function, DUF5472.1e-17457.79Show/hide
Query:  DKQAIRDDRLDNSL-ELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGH---GVFPCNKSDKISMPKSAMELIRE
        +K+ +  D+  NS  E S  +K+DM +  E    K++        SLKQEIT LE RLQDQFK+R ALEKALG+     +   +++ I+MPK A +LI++
Subjt:  DKQAIRDDRLDNSL-ELSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGH---GVFPCNKSDKISMPKSAMELIRE

Query:  IATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLP--STPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQ
        +A LE+EV+HLEQYLLSLYRKAF+ Q SSVSP+ +++K K P  +TPR R   +   D  SK +    P      +NQ K       D + +  +F RS 
Subjt:  IATLELEVVHLEQYLLSLYRKAFDGQSSSVSPSAKDEKSKLP--STPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQ

Query:  SSLTTVNAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAE-PSLSNHGLSSPTSS
        S      +A   + ++  +S  K  R+CHSQP+    Y QN   N+ISLAEHLGTRISDHVPETPN+LSE MVKC+S IYCKLAE PS+ + GLSSP SS
Subjt:  SSLTTVNAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQNVSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAE-PSLSNHGLSSPTSS

Query:  LSSVSAFSPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIH
        LSS SAFSP +Q    SPGF N+SSFDVRLDN F VEG KDFSGPYS+++E+  I  D +K  +VE LL+NF+ LISRLEEVD RKLK+EEKLAFWIN+H
Subjt:  LSSVSAFSPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEISWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIH

Query:  NSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVR
        N+LVMH +LAYG+PQNNVKR  LLLK+AYNIGGHTIS + I+S ILGC+M  P QWLRLL  SR KFK GDER AY ID PEPLLHFAL SGSHSDPAVR
Subjt:  NSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLPSRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVR

Query:  VYTSKRVFQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKR--AALGNPRKNVEWIPPSYTFRYLISKE
        VYT KR+ QELE++K+EYIR    +RK Q+ILLPK++E+FAKDSGLC +GL EM+ +S+PES RK VKR  ++   PRK ++WIP S+TFRYLI +E
Subjt:  VYTSKRVFQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKR--AALGNPRKNVEWIPPSYTFRYLISKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTTTTTTCTTTGTGGGTCTGCAGCAGCCTTTCGTAGCCTCACTACTCAGTCTCATGAGCCACGGATTGATCACTTTCTGGTTTCTTCCATTTTGGCTCCCTTGAT
CTCTCTTCCTGTTTTGTTACTTCTATTCTTGTTTGGGAGTTGGGGTTTTTTGTTTTGGGAGTTTGTTTACAATTTTGAGTTGGTTTTAAGGATTTGGAATCTGATTGGAA
GTGGGGTTAGAAGAAGAAATGCTGGGAGTGGATGTGAAAACAGGACACAGCCGCTCCAAAAGCACGACAAGCAAGCGATTCGAGATGATCGATTGGATAATTCACTCGAA
CTTTCCAACACTGTAAAGATGGATATGAATCAGGTGAAAGAAGTAGAGAATAAGAAAAACTCGTCGCCGAAGATGGAAGTGCATAATTCTTTGAAGCAAGAGATTACTCA
ACTTGAGAAGCGATTGCAAGACCAGTTCAAGCTCCGCTCTGCATTAGAAAAAGCATTGGGTCATGGGGTCTTTCCCTGTAACAAATCAGACAAAATCTCAATGCCAAAGT
CAGCTATGGAGTTAATCAGGGAGATTGCAACGTTAGAACTAGAAGTTGTGCATTTAGAACAATACCTACTTTCTTTGTATCGAAAAGCATTTGATGGACAATCATCCTCT
GTATCTCCTTCTGCTAAGGACGAAAAGTCGAAATTGCCTTCAACCCCTAGAGGCAGGTCAATGGAAGCTCCCCTACCTGATATTGCCTCAAAAGATGAAAGCTCAGCAGC
CCCTTCTGGTTGCCAGTCACTTGAAAATCAAAGGAAAGATTGCAGTGATATTGGGAGAGATGAAAATCTATTGATTTCCAATTTTCATCGTTCTCAATCGTCACTGACAA
CGGTAAATGCAGCTGCATTGGATAAAGTGTCTACTTCAGTAGAGTCATTGGATAAAACTCTGCGTGCATGTCATTCCCAACCGGTGTCCATGATGGAGTATGCACAAAAT
GTTTCATCCAATATTATCAGTCTCGCTGAGCATCTTGGAACTCGCATCTCTGACCACGTACCGGAGACTCCTAACCGACTTTCAGAAGACATGGTCAAGTGCATATCAAC
CATATATTGCAAACTTGCCGAACCATCTTTGTCGAATCATGGACTTTCATCTCCAACGTCTTCTTTGTCATCAGTCAGTGCATTTTCTCCAGGAGAACAGTGTGCCATGT
GCAGTCCAGGGTTCAGAAACAATTCATCCTTCGACGTTCGTTTGGACAATCCTTTTCTTGTGGAAGGTCTTAAAGATTTTAGTGGGCCATACAGTACAATGATTGAGATT
TCATGGATCTGTGGAGACCCTCAGAAACTACGTGATGTCGAGAGTTTGCTTGAAAACTTCAGGTTGCTTATCTCTCGTTTAGAGGAAGTCGATTTGAGGAAACTTAAGTA
CGAGGAAAAGCTGGCTTTCTGGATAAATATTCACAACTCGCTTGTGATGCATACATACTTAGCTTATGGGGTACCCCAAAACAATGTCAAGAGGGCCTTTTTACTCTTGA
AGTCTGCATATAACATAGGAGGCCATACAATTAGTGTAGACACAATAGAAAGTTGTATTCTTGGCTGTCGGATGCCTCGCCCTCGGCAGTGGCTTCGGCTCTTGCTTCCT
TCAAGGACGAAATTCAAGACCGGAGATGAGCGACAAGCATATATAATTGACCGTCCAGAACCCCTTTTACATTTCGCACTTTGCTCCGGGAGCCACTCCGATCCTGCGGT
TCGTGTTTATACATCTAAAAGAGTGTTTCAAGAACTGGAATCTGCAAAAGACGAGTATATTCGAGCAACCTTCGGTGTACGAAAAGATCAAAAGATCCTCCTTCCAAAGA
TGATCGAGTCATTCGCAAAGGACTCTGGTCTATGTACATCTGGCCTGATGGAGATGATTCTCAAGAGCTTGCCTGAATCTCTGAGAAAGAGTGTCAAACGAGCTGCGCTC
GGGAATCCTCGGAAGAATGTCGAATGGATTCCTCCAAGTTACACTTTCAGGTATCTGATCTCCAAGGAGCTGATGGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGACCTTTTTTCTTTGTGGGTCTGCAGCAGCCTTTCGTAGCCTCACTACTCAGTCTCATGAGCCACGGATTGATCACTTTCTGGTTTCTTCCATTTTGGCTCCCTTGAT
CTCTCTTCCTGTTTTGTTACTTCTATTCTTGTTTGGGAGTTGGGGTTTTTTGTTTTGGGAGTTTGTTTACAATTTTGAGTTGGTTTTAAGGATTTGGAATCTGATTGGAA
GTGGGGTTAGAAGAAGAAATGCTGGGAGTGGATGTGAAAACAGGACACAGCCGCTCCAAAAGCACGACAAGCAAGCGATTCGAGATGATCGATTGGATAATTCACTCGAA
CTTTCCAACACTGTAAAGATGGATATGAATCAGGTGAAAGAAGTAGAGAATAAGAAAAACTCGTCGCCGAAGATGGAAGTGCATAATTCTTTGAAGCAAGAGATTACTCA
ACTTGAGAAGCGATTGCAAGACCAGTTCAAGCTCCGCTCTGCATTAGAAAAAGCATTGGGTCATGGGGTCTTTCCCTGTAACAAATCAGACAAAATCTCAATGCCAAAGT
CAGCTATGGAGTTAATCAGGGAGATTGCAACGTTAGAACTAGAAGTTGTGCATTTAGAACAATACCTACTTTCTTTGTATCGAAAAGCATTTGATGGACAATCATCCTCT
GTATCTCCTTCTGCTAAGGACGAAAAGTCGAAATTGCCTTCAACCCCTAGAGGCAGGTCAATGGAAGCTCCCCTACCTGATATTGCCTCAAAAGATGAAAGCTCAGCAGC
CCCTTCTGGTTGCCAGTCACTTGAAAATCAAAGGAAAGATTGCAGTGATATTGGGAGAGATGAAAATCTATTGATTTCCAATTTTCATCGTTCTCAATCGTCACTGACAA
CGGTAAATGCAGCTGCATTGGATAAAGTGTCTACTTCAGTAGAGTCATTGGATAAAACTCTGCGTGCATGTCATTCCCAACCGGTGTCCATGATGGAGTATGCACAAAAT
GTTTCATCCAATATTATCAGTCTCGCTGAGCATCTTGGAACTCGCATCTCTGACCACGTACCGGAGACTCCTAACCGACTTTCAGAAGACATGGTCAAGTGCATATCAAC
CATATATTGCAAACTTGCCGAACCATCTTTGTCGAATCATGGACTTTCATCTCCAACGTCTTCTTTGTCATCAGTCAGTGCATTTTCTCCAGGAGAACAGTGTGCCATGT
GCAGTCCAGGGTTCAGAAACAATTCATCCTTCGACGTTCGTTTGGACAATCCTTTTCTTGTGGAAGGTCTTAAAGATTTTAGTGGGCCATACAGTACAATGATTGAGATT
TCATGGATCTGTGGAGACCCTCAGAAACTACGTGATGTCGAGAGTTTGCTTGAAAACTTCAGGTTGCTTATCTCTCGTTTAGAGGAAGTCGATTTGAGGAAACTTAAGTA
CGAGGAAAAGCTGGCTTTCTGGATAAATATTCACAACTCGCTTGTGATGCATACATACTTAGCTTATGGGGTACCCCAAAACAATGTCAAGAGGGCCTTTTTACTCTTGA
AGTCTGCATATAACATAGGAGGCCATACAATTAGTGTAGACACAATAGAAAGTTGTATTCTTGGCTGTCGGATGCCTCGCCCTCGGCAGTGGCTTCGGCTCTTGCTTCCT
TCAAGGACGAAATTCAAGACCGGAGATGAGCGACAAGCATATATAATTGACCGTCCAGAACCCCTTTTACATTTCGCACTTTGCTCCGGGAGCCACTCCGATCCTGCGGT
TCGTGTTTATACATCTAAAAGAGTGTTTCAAGAACTGGAATCTGCAAAAGACGAGTATATTCGAGCAACCTTCGGTGTACGAAAAGATCAAAAGATCCTCCTTCCAAAGA
TGATCGAGTCATTCGCAAAGGACTCTGGTCTATGTACATCTGGCCTGATGGAGATGATTCTCAAGAGCTTGCCTGAATCTCTGAGAAAGAGTGTCAAACGAGCTGCGCTC
GGGAATCCTCGGAAGAATGTCGAATGGATTCCTCCAAGTTACACTTTCAGGTATCTGATCTCCAAGGAGCTGATGGCCTAA
Protein sequenceShow/hide protein sequence
MTFFLCGSAAAFRSLTTQSHEPRIDHFLVSSILAPLISLPVLLLLFLFGSWGFLFWEFVYNFELVLRIWNLIGSGVRRRNAGSGCENRTQPLQKHDKQAIRDDRLDNSLE
LSNTVKMDMNQVKEVENKKNSSPKMEVHNSLKQEITQLEKRLQDQFKLRSALEKALGHGVFPCNKSDKISMPKSAMELIREIATLELEVVHLEQYLLSLYRKAFDGQSSS
VSPSAKDEKSKLPSTPRGRSMEAPLPDIASKDESSAAPSGCQSLENQRKDCSDIGRDENLLISNFHRSQSSLTTVNAAALDKVSTSVESLDKTLRACHSQPVSMMEYAQN
VSSNIISLAEHLGTRISDHVPETPNRLSEDMVKCISTIYCKLAEPSLSNHGLSSPTSSLSSVSAFSPGEQCAMCSPGFRNNSSFDVRLDNPFLVEGLKDFSGPYSTMIEI
SWICGDPQKLRDVESLLENFRLLISRLEEVDLRKLKYEEKLAFWINIHNSLVMHTYLAYGVPQNNVKRAFLLLKSAYNIGGHTISVDTIESCILGCRMPRPRQWLRLLLP
SRTKFKTGDERQAYIIDRPEPLLHFALCSGSHSDPAVRVYTSKRVFQELESAKDEYIRATFGVRKDQKILLPKMIESFAKDSGLCTSGLMEMILKSLPESLRKSVKRAAL
GNPRKNVEWIPPSYTFRYLISKELMA