; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026329 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026329
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function, DUF547
Genome locationtig00153031:4133212..4137482
RNA-Seq ExpressionSgr026329
SyntenySgr026329
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006869 - Domain of unknown function DUF547
IPR025757 - Ternary complex factor MIP1, leucine-zipper


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK12940.1 uncharacterized protein E5676_scaffold255G005350 [Cucumis melo var. makuwa]1.9e-27878.71Show/hide
Query:  SSSASSDLQSLYCFSLSTFSIIGFPD--KDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALG
        S S   +L+S YCFSLSTFS+IGF D  KDT EEE+LD SFEASNR+KL                               LEKRLQ+QFKVRT LE ALG
Subjt:  SSSASSDLQSLYCFSLSTFSIIGFPD--KDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALG

Query:  YRSSSQDNTNDMEVPKPATELIREIAVLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGD
        YRSSSQDNT D+EVPKPATELI+EIAVLELEV+HLEQYLLSLYRKAFDG I+STSPST DEKLKSP+NSP AKYIVN   DIA KKEDKAVQSGYD+FG+
Subjt:  YRSSSQDNTNDMEVPKPATELIREIAVLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGD

Query:  PVKEYSGIGEDNLLDSSVRRCHSSLSHYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAI
        P++EYSGI ED LLDSSVRRC SSLSHYSV SKR S PEDSLG+ VRPCLSQP+SMMEFAQNASSNL SLAEYLGTQI DH+P+SANRLSEDM+KCISAI
Subjt:  PVKEYSGIGEDNLLDSSVRRCHSSLSHYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAI

Query:  YCKLSDPPLSHHGLSSPVSSSSPISAFSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFI
        YCKLSDPP +HHGLSSPVSSSSPISAFSPQDQFDMLSPGF+NNP FDI LDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIE+               
Subjt:  YCKLSDPPLSHHGLSSPVSSSSPISAFSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFI

Query:  CHVNWKIFLTLISKLEEVDPRKLNHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSK
             + F +LISKLEEV+PRKLNHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKVF+LLKAAYNIGG TISVDTIQSSILGCR+PRPGQWLSLL+PSK
Subjt:  CHVNWKIFLTLISKLEEVDPRKLNHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSK

Query:  SKFKNGDKRQAYKIHRPEPLLHFALCTGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLR
        SK K+GDKR AYKIH+ EPLLHFALCTGCHSDPAVRVYTPK VLQELE AKEEY+RATFGIRKD+KV+LPKI+ESFAKE RLCT G MEMIQ+SLPESLR
Subjt:  SKFKNGDKRQAYKIHRPEPLLHFALCTGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLR

Query:  KSITKCQNGKSRKNVEWIPHNFTFRYLISREMVK
        +S++KCQNGKSRKN+EWI HNFTFRYLISREMVK
Subjt:  KSITKCQNGKSRKNVEWIPHNFTFRYLISREMVK

XP_022133011.1 uncharacterized protein LOC111005708 isoform X1 [Momordica charantia]5.6e-27581.71Show/hide
Query:  PDKDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALGYRSSSQDNTNDMEVPKPATELIREIA
        PDKDT  EE LD SFEASNRIKL                               LEKRLQDQFKVRTALEKALGYRSSSQDN ND+EVPKPATELIREIA
Subjt:  PDKDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALGYRSSSQDNTNDMEVPKPATELIREIA

Query:  VLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSSLS
        VLE+EV+HLEQYLLSLYRKAFDG I+STSPSTN+EKLKSPMNSPTAKYIVN    I SKKEDKAVQSGYDAFG PVKEYSGIGEDNLLDSSVRRCHSSLS
Subjt:  VLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSSLS

Query:  HYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISA
        HYSVFSKR S PEDSLG+ VRPCLSQP+SMMEFA NASSNL SLAEYLGTQI+DHIP+SANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISA
Subjt:  HYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISA

Query:  FSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHE
        FSPQDQFDML    +N+P+FDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIE+                    + F +LISKLEEVD RKLNHE
Subjt:  FSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHE

Query:  EKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALC
        EKLAFWINVHNSLMMHAYLAYGIPQNN+KK FLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSL+LPSKSKFKNGD+RQAYKIHRPEPLLHFALC
Subjt:  EKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALC

Query:  TGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQNGKSRKNVEWIPHNFTFRY
        TGCHSDPAVRVYTPKRV+ ELE AK EY+RATFGIRKD+KV+LPKII+SFAKE RLCTTG MEMIQ+SLPESLRK++TK QNGKSRKNVEW+PHNF+FRY
Subjt:  TGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQNGKSRKNVEWIPHNFTFRY

Query:  LISREMV
        LISRE+V
Subjt:  LISREMV

XP_031744119.1 uncharacterized protein LOC101222802 isoform X1 [Cucumis sativus]2.9e-27978.83Show/hide
Query:  SSASSDLQSLYCFSLSTFSIIGFPD--KDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALGY
        +S   DL+SL+CFS STFS+IGF D  KDT EEE LD SFEASNR+KL                               LEKRLQDQFKVRT LE ALGY
Subjt:  SSASSDLQSLYCFSLSTFSIIGFPD--KDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALGY

Query:  RSSSQDNTNDMEVPKPATELIREIAVLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGDP
        RSSSQDNT D+EVPKPATELI+EIAVLELEV+HLEQYLLSLYRKAFDG I+STSPST DEKLKSP+NSP AKY VN   D+  KKEDKAVQSGYD+FG+P
Subjt:  RSSSQDNTNDMEVPKPATELIREIAVLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGDP

Query:  VKEYSGIGEDNLLDSSVRRCHSSLSHYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIY
        ++EYSGI ED LLDSSVRRC SSLSHYSV SKR S PEDSLG+ VRPCLSQP+SMMEFAQNASSNL SLAEYLGTQI DH+P+SANRLSEDM+KCISAIY
Subjt:  VKEYSGIGEDNLLDSSVRRCHSSLSHYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIY

Query:  CKLSDPPLSHHGLSSPVSSSSPISAFSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFIC
        CKLSDPP +HHGLSSPVSSSSPISAFSPQDQFDMLSPGF+NNPSFDI LDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIE+                
Subjt:  CKLSDPPLSHHGLSSPVSSSSPISAFSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFIC

Query:  HVNWKIFLTLISKLEEVDPRKLNHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKS
            + F +LISKLEEVDPRKLNHEEKLAFW NVHNSLMMHAYLAYGIPQNNMKKVF+LLKAAYNIGG TISVDTIQSSILGCR+PRPGQWLSLL+PSKS
Subjt:  HVNWKIFLTLISKLEEVDPRKLNHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKS

Query:  KFKNGDKRQAYKIHRPEPLLHFALCTGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRK
        K KNGDKR AYKIH+ EPLLHFALCTGCHSDPAVRVYTPK VLQELE AKEEY+RATFGIRKD+KV+LPKI+ESFAKE RLCT GMMEMIQ+SLPESLR+
Subjt:  KFKNGDKRQAYKIHRPEPLLHFALCTGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRK

Query:  SITKCQNGKSRKNVEWIPHNFTFRYLISREMVK
        S+ KCQNGKSRKN+EWI HNFTFRYLISREMVK
Subjt:  SITKCQNGKSRKNVEWIPHNFTFRYLISREMVK

XP_038881198.1 uncharacterized protein LOC120072780 isoform X1 [Benincasa hispida]1.5e-27581.35Show/hide
Query:  KDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALGYRSSSQDNTNDMEVPKPATELIREIAVL
        KDT EEEQLD SFEASNR+KL                               LEKRLQDQFKVRT LE ALGYR S QDNTND+EVPKPATELI+EIAVL
Subjt:  KDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALGYRSSSQDNTNDMEVPKPATELIREIAVL

Query:  ELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSSLSHY
        ELEV+HLEQYLLSLYRKAFDG I+STSPSTNDEKLKSPMNSP AKYIVN   DI SKKEDKAVQSGYD+FG+PV+EYSGI  D LLDSSVRRC SSLSHY
Subjt:  ELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSSLSHY

Query:  SVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISAFS
        SV SKR S PEDSLG+ VRPCLSQP+SMMEFAQNASSNL SLAEYLGTQI DH+P+SANRLSEDM+KCISAIYCKLSDPP SHH LSSPVSSSSPISAFS
Subjt:  SVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISAFS

Query:  PQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHEEK
        PQDQFDMLSPGFRNNPSFD+ LDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIE+                    + F +LISKLEEVDPRKLNHEEK
Subjt:  PQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHEEK

Query:  LAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALCTG
        LAFWINVHNSLMMHAYLAYGIPQNNMKKVF+LLKAAYNIGG TISVDTIQSSILGCR+PRPGQWLSLL+PSKSK KNGDKRQAYKIHR EPLLHFALCTG
Subjt:  LAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALCTG

Query:  CHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQNGKSRKNVEWIPHNFTFRYLI
        CHSDPAVRVYTPKRVLQELE AK+EY+RATFGIRKD+KV+LPKIIESFAKE RLCTTG+MEMIQ+SLPESLRKS+ KCQNGKSRKN+EWI HNFTFRYLI
Subjt:  CHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQNGKSRKNVEWIPHNFTFRYLI

Query:  SREMVK
        SREMVK
Subjt:  SREMVK

XP_038881200.1 uncharacterized protein LOC120072780 isoform X2 [Benincasa hispida]1.5e-27581.35Show/hide
Query:  KDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALGYRSSSQDNTNDMEVPKPATELIREIAVL
        KDT EEEQLD SFEASNR+KL                               LEKRLQDQFKVRT LE ALGYR S QDNTND+EVPKPATELI+EIAVL
Subjt:  KDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALGYRSSSQDNTNDMEVPKPATELIREIAVL

Query:  ELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSSLSHY
        ELEV+HLEQYLLSLYRKAFDG I+STSPSTNDEKLKSPMNSP AKYIVN   DI SKKEDKAVQSGYD+FG+PV+EYSGI  D LLDSSVRRC SSLSHY
Subjt:  ELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSSLSHY

Query:  SVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISAFS
        SV SKR S PEDSLG+ VRPCLSQP+SMMEFAQNASSNL SLAEYLGTQI DH+P+SANRLSEDM+KCISAIYCKLSDPP SHH LSSPVSSSSPISAFS
Subjt:  SVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISAFS

Query:  PQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHEEK
        PQDQFDMLSPGFRNNPSFD+ LDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIE+                    + F +LISKLEEVDPRKLNHEEK
Subjt:  PQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHEEK

Query:  LAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALCTG
        LAFWINVHNSLMMHAYLAYGIPQNNMKKVF+LLKAAYNIGG TISVDTIQSSILGCR+PRPGQWLSLL+PSKSK KNGDKRQAYKIHR EPLLHFALCTG
Subjt:  LAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALCTG

Query:  CHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQNGKSRKNVEWIPHNFTFRYLI
        CHSDPAVRVYTPKRVLQELE AK+EY+RATFGIRKD+KV+LPKIIESFAKE RLCTTG+MEMIQ+SLPESLRKS+ KCQNGKSRKN+EWI HNFTFRYLI
Subjt:  CHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQNGKSRKNVEWIPHNFTFRYLI

Query:  SREMVK
        SREMVK
Subjt:  SREMVK

TrEMBL top hitse value%identityAlignment
A0A0A0KL33 Uncharacterized protein3.7e-27279.7Show/hide
Query:  KDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALGYRSSSQDNTNDMEVPKPATELIREIAVL
        KDT EEE LD SFEASNR+KL                               LEKRLQDQFKVRT LE ALGYRSSSQDNT D+EVPKPATELI+EIAVL
Subjt:  KDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALGYRSSSQDNTNDMEVPKPATELIREIAVL

Query:  ELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSSLSHY
        ELEV+HLEQYLLSLYRKAFDG I+STSPST DEKLKSP+NSP AKY VN   D+  KKEDKAVQSGYD+FG+P++EYSGI ED LLDSSVRRC SSLSHY
Subjt:  ELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSSLSHY

Query:  SVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISAFS
        SV SKR S PEDSLG+ VRPCLSQP+SMMEFAQNASSNL SLAEYLGTQI DH+P+SANRLSEDM+KCISAIYCKLSDPP +HHGLSSPVSSSSPISAFS
Subjt:  SVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISAFS

Query:  PQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHEEK
        PQDQFDMLSPGF+NNPSFDI LDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIE+                    + F +LISKLEEVDPRKLNHEEK
Subjt:  PQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHEEK

Query:  LAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALCTG
        LAFW NVHNSLMMHAYLAYGIPQNNMKKVF+LLKAAYNIGG TISVDTIQSSILGCR+PRPGQWLSLL+PSKSK KNGDKR AYKIH+ EPLLHFALCTG
Subjt:  LAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALCTG

Query:  CHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQNGKSRKNVEWIPHNFTFRYLI
        CHSDPAVRVYTPK VLQELE AKEEY+RATFGIRKD+KV+LPKI+ESFAKE RLCT GMMEMIQ+SLPESLR+S+ KCQNGKSRKN+EWI HNFTFRYLI
Subjt:  CHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQNGKSRKNVEWIPHNFTFRYLI

Query:  SREMVK
        SREMVK
Subjt:  SREMVK

A0A1S3B0I9 uncharacterized protein LOC103484720 isoform X21.8e-27179.54Show/hide
Query:  KDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALGYRSSSQDNTNDMEVPKPATELIREIAVL
        KDT EEE+LD SFEASNR+KL                               LEKRLQ+QFKVRT LE ALGYRSSSQDNT D+EVPKPATELI+EIAVL
Subjt:  KDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALGYRSSSQDNTNDMEVPKPATELIREIAVL

Query:  ELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSSLSHY
        ELEV+HLEQYLLSLYRKAFDG I+STSPST DEKLKSP+NSP AKYIVN   DIA KKEDKAVQSGYD+FG+P++EYSGI ED LLDSSVRRC SSLSHY
Subjt:  ELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSSLSHY

Query:  SVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISAFS
        SV SKR S PEDSLG+ VRPCLSQP+SMMEFAQNASSNL SLAEYLGTQI DH+P+SANRLSEDM+KCISAIYCKLSDPP +HHGLSSPVSSSSPISAFS
Subjt:  SVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISAFS

Query:  PQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHEEK
        PQDQFDMLSPGF+NNP FDI LDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIE+                    + F +LISKLEEV+PRKLNHEEK
Subjt:  PQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHEEK

Query:  LAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALCTG
        LAFWINVHNSLMMHAYLAYGIPQNNMKKVF+LLKAAYNIGG TISVDTIQSSILGCR+PRPGQWLSLL+PSKSK K+GDKR AYKIH+ EPLLHFALCTG
Subjt:  LAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALCTG

Query:  CHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQNGKSRKNVEWIPHNFTFRYLI
        CHSDPAVRVYTPK VLQELE AKEEY+RATFGIRKD+KV+LPKI+ESFAKE RLCT G MEMIQ+SLPESLR+S++KCQNGKSRKN+EWI HNFTFRYLI
Subjt:  CHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQNGKSRKNVEWIPHNFTFRYLI

Query:  SREMVK
        SREMVK
Subjt:  SREMVK

A0A5D3CNB6 Uncharacterized protein9.0e-27978.71Show/hide
Query:  SSSASSDLQSLYCFSLSTFSIIGFPD--KDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALG
        S S   +L+S YCFSLSTFS+IGF D  KDT EEE+LD SFEASNR+KL                               LEKRLQ+QFKVRT LE ALG
Subjt:  SSSASSDLQSLYCFSLSTFSIIGFPD--KDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALG

Query:  YRSSSQDNTNDMEVPKPATELIREIAVLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGD
        YRSSSQDNT D+EVPKPATELI+EIAVLELEV+HLEQYLLSLYRKAFDG I+STSPST DEKLKSP+NSP AKYIVN   DIA KKEDKAVQSGYD+FG+
Subjt:  YRSSSQDNTNDMEVPKPATELIREIAVLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGD

Query:  PVKEYSGIGEDNLLDSSVRRCHSSLSHYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAI
        P++EYSGI ED LLDSSVRRC SSLSHYSV SKR S PEDSLG+ VRPCLSQP+SMMEFAQNASSNL SLAEYLGTQI DH+P+SANRLSEDM+KCISAI
Subjt:  PVKEYSGIGEDNLLDSSVRRCHSSLSHYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAI

Query:  YCKLSDPPLSHHGLSSPVSSSSPISAFSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFI
        YCKLSDPP +HHGLSSPVSSSSPISAFSPQDQFDMLSPGF+NNP FDI LDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIE+               
Subjt:  YCKLSDPPLSHHGLSSPVSSSSPISAFSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFI

Query:  CHVNWKIFLTLISKLEEVDPRKLNHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSK
             + F +LISKLEEV+PRKLNHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKVF+LLKAAYNIGG TISVDTIQSSILGCR+PRPGQWLSLL+PSK
Subjt:  CHVNWKIFLTLISKLEEVDPRKLNHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSK

Query:  SKFKNGDKRQAYKIHRPEPLLHFALCTGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLR
        SK K+GDKR AYKIH+ EPLLHFALCTGCHSDPAVRVYTPK VLQELE AKEEY+RATFGIRKD+KV+LPKI+ESFAKE RLCT G MEMIQ+SLPESLR
Subjt:  SKFKNGDKRQAYKIHRPEPLLHFALCTGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLR

Query:  KSITKCQNGKSRKNVEWIPHNFTFRYLISREMVK
        +S++KCQNGKSRKN+EWI HNFTFRYLISREMVK
Subjt:  KSITKCQNGKSRKNVEWIPHNFTFRYLISREMVK

A0A6J1BTU6 uncharacterized protein LOC111005708 isoform X12.7e-27581.71Show/hide
Query:  PDKDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALGYRSSSQDNTNDMEVPKPATELIREIA
        PDKDT  EE LD SFEASNRIKL                               LEKRLQDQFKVRTALEKALGYRSSSQDN ND+EVPKPATELIREIA
Subjt:  PDKDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALGYRSSSQDNTNDMEVPKPATELIREIA

Query:  VLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSSLS
        VLE+EV+HLEQYLLSLYRKAFDG I+STSPSTN+EKLKSPMNSPTAKYIVN    I SKKEDKAVQSGYDAFG PVKEYSGIGEDNLLDSSVRRCHSSLS
Subjt:  VLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSSLS

Query:  HYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISA
        HYSVFSKR S PEDSLG+ VRPCLSQP+SMMEFA NASSNL SLAEYLGTQI+DHIP+SANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISA
Subjt:  HYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISA

Query:  FSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHE
        FSPQDQFDML    +N+P+FDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIE+                    + F +LISKLEEVD RKLNHE
Subjt:  FSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHE

Query:  EKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALC
        EKLAFWINVHNSLMMHAYLAYGIPQNN+KK FLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSL+LPSKSKFKNGD+RQAYKIHRPEPLLHFALC
Subjt:  EKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALC

Query:  TGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQNGKSRKNVEWIPHNFTFRY
        TGCHSDPAVRVYTPKRV+ ELE AK EY+RATFGIRKD+KV+LPKII+SFAKE RLCTTG MEMIQ+SLPESLRK++TK QNGKSRKNVEW+PHNF+FRY
Subjt:  TGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQNGKSRKNVEWIPHNFTFRY

Query:  LISREMV
        LISRE+V
Subjt:  LISREMV

A0A6J1BUR1 uncharacterized protein LOC111005708 isoform X22.7e-27581.71Show/hide
Query:  PDKDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALGYRSSSQDNTNDMEVPKPATELIREIA
        PDKDT  EE LD SFEASNRIKL                               LEKRLQDQFKVRTALEKALGYRSSSQDN ND+EVPKPATELIREIA
Subjt:  PDKDTFEEEQLDVSFEASNRIKL-------------------------------LEKRLQDQFKVRTALEKALGYRSSSQDNTNDMEVPKPATELIREIA

Query:  VLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSSLS
        VLE+EV+HLEQYLLSLYRKAFDG I+STSPSTN+EKLKSPMNSPTAKYIVN    I SKKEDKAVQSGYDAFG PVKEYSGIGEDNLLDSSVRRCHSSLS
Subjt:  VLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVN---DIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSSLS

Query:  HYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISA
        HYSVFSKR S PEDSLG+ VRPCLSQP+SMMEFA NASSNL SLAEYLGTQI+DHIP+SANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISA
Subjt:  HYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISA

Query:  FSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHE
        FSPQDQFDML    +N+P+FDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIE+                    + F +LISKLEEVD RKLNHE
Subjt:  FSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHE

Query:  EKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALC
        EKLAFWINVHNSLMMHAYLAYGIPQNN+KK FLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSL+LPSKSKFKNGD+RQAYKIHRPEPLLHFALC
Subjt:  EKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALC

Query:  TGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQNGKSRKNVEWIPHNFTFRY
        TGCHSDPAVRVYTPKRV+ ELE AK EY+RATFGIRKD+KV+LPKII+SFAKE RLCTTG MEMIQ+SLPESLRK++TK QNGKSRKNVEW+PHNF+FRY
Subjt:  TGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQNGKSRKNVEWIPHNFTFRY

Query:  LISREMV
        LISRE+V
Subjt:  LISREMV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G23700.1 Protein of unknown function, DUF5472.4e-14647.35Show/hide
Query:  DVSFEAS--NRIKLLEKRLQDQFKVRTALEKALGYRSSSQDNTNDMEVPKPATELIREIAVLELEVAHLEQYLLSLYRKAFDGHIASTSPSTN-------
        DV F++S    I+ LEKRLQ+QF VR ALEKALGY++ S+D   D   PKP TELI+EIAVLELEV+HLEQYLLSLYRKAFD   +S SP T+       
Subjt:  DVSFEAS--NRIKLLEKRLQDQFKVRTALEKALGYRSSSQDNTNDMEVPKPATELIREIAVLELEVAHLEQYLLSLYRKAFDGHIASTSPSTN-------

Query:  -----------------------DEKLKSP------MNSPTAKYIVNDIASK------------------------------------------------
                               D +LKSP      + SP  +     +A++                                                
Subjt:  -----------------------DEKLKSP------MNSPTAKYIVNDIASK------------------------------------------------

Query:  -KEDKAVQSGYDAFGDPVKE-YSGIGE---------------DNLL-----------DSSVRRCHSSLSHYSVFSKRASFPEDSLGRDVRPCLSQPLSMM
         +ED  + S   +F + VKE  SG+ +               DN L           DS VRRC SSL+  S F+ R S PEDS    V  C SQPLS+ 
Subjt:  -KEDKAVQSGYDAFGDPVKE-YSGIGE---------------DNLL-----------DSSVRRCHSSLSHYSVFSKRASFPEDSLGRDVRPCLSQPLSMM

Query:  EFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISAFSPQDQFDMLSPGFRNNPSFDILLDNPFHVE
        E+ QN  SN  SLAE++GT+ISDHI  + N+LSE+MIKC SAIY KL+DPP  +HG SSP SS S  S FSPQDQ+DM SP FR N SF    D+ F   
Subjt:  EFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPPLSHHGLSSPVSSSSPISAFSPQDQFDMLSPGFRNNPSFDILLDNPFHVE

Query:  GLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKV
           EFSGPYS+M+EV  I+R ++K  ++    R                 F  L+ +LE VDPRKL H+EKLAFWINVHN+L+MH +LA GIPQNN K+ 
Subjt:  GLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKV

Query:  FLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALCTGCHSDPAVRVYTPKRVLQELEIAKEEYVRA
         LL K AY IGG  +S++ IQS IL  +MPRPGQWL LLL  K KF+ GD+ Q Y +   EPLL+FALC+G HSDPA+RV+TPK + QELE AKEEY+RA
Subjt:  FLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALCTGCHSDPAVRVYTPKRVLQELEIAKEEYVRA

Query:  TFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQNGKSRKN-VEWIPHNFTFRYLISREMVK
        TFG++KD+K++LPKIIESF+K+  L    +MEMIQE LPE+++K+I K  +G+SRK+ VEW PHNF FRYLI+RE+V+
Subjt:  TFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQNGKSRKN-VEWIPHNFTFRYLISREMVK

AT5G66600.1 Protein of unknown function, DUF5472.9e-16854.16Show/hide
Query:  FPDKDTFE-EEQLDVSFEASNRIKL---------------------------LEKRLQDQFKVRTALEKALGYRSSSQ---DNTNDMEVPKPATELIREI
        FP+K   E ++  + S EAS R+KL                           LE RLQDQFKVR ALEKALGYR++S      TND+ +PKPAT+LI+++
Subjt:  FPDKDTFE-EEQLDVSFEASNRIKL---------------------------LEKRLQDQFKVRTALEKALGYRSSSQ---DNTNDMEVPKPATELIREI

Query:  AVLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKS-PMNSPTAKYIV---NDIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSS
        AVLE+EV HLEQYLLSLYRKAF+  I+S SP+  ++K KS P+ +P  +      +D  SK +   V    D      K      + + +D S RR H  
Subjt:  AVLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKS-PMNSPTAKYIV---NDIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSS

Query:  LSHYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPP-LSHHGLSSPVSSSSP
         S  S F  R + PEDS G+  R C SQPL    + QN   NL+SLAE+LGT+ISDH+P++ N+LSE M+KC+S IYCKL++PP + H GLSSP SS S 
Subjt:  LSHYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPP-LSHHGLSSPVSSSSP

Query:  ISAFSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKL
         SAFSP DQ+D  SPGF N+ SFD+ LDN FHVEG K+FSGPYS++VEV  IYRD++K  E+                    + F +LIS+LEEVDPRKL
Subjt:  ISAFSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKL

Query:  NHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHF
         HEEKLAFWINVHN+L+MHA+LAYGIPQNN+K+V LLLKAAYNIGGHTIS + IQSSILGC+M  PGQWL LL  S+ KFK GD+R AY I  PEPLLHF
Subjt:  NHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHF

Query:  ALCTGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQN--GKSRKNVEWIPHN
        AL +G HSDPAVRVYTPKR+ QELE +KEEY+R    IRK R ++LPK++E+FAK+  LC  G+ EM+  S+PES RK + +CQ+   K RK ++WIPH+
Subjt:  ALCTGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQN--GKSRKNVEWIPHN

Query:  FTFRYLISREMVK
        FTFRYLI RE  K
Subjt:  FTFRYLISREMVK

AT5G66600.2 Protein of unknown function, DUF5472.9e-16854.16Show/hide
Query:  FPDKDTFE-EEQLDVSFEASNRIKL---------------------------LEKRLQDQFKVRTALEKALGYRSSSQ---DNTNDMEVPKPATELIREI
        FP+K   E ++  + S EAS R+KL                           LE RLQDQFKVR ALEKALGYR++S      TND+ +PKPAT+LI+++
Subjt:  FPDKDTFE-EEQLDVSFEASNRIKL---------------------------LEKRLQDQFKVRTALEKALGYRSSSQ---DNTNDMEVPKPATELIREI

Query:  AVLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKS-PMNSPTAKYIV---NDIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSS
        AVLE+EV HLEQYLLSLYRKAF+  I+S SP+  ++K KS P+ +P  +      +D  SK +   V    D      K      + + +D S RR H  
Subjt:  AVLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKS-PMNSPTAKYIV---NDIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSS

Query:  LSHYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPP-LSHHGLSSPVSSSSP
         S  S F  R + PEDS G+  R C SQPL    + QN   NL+SLAE+LGT+ISDH+P++ N+LSE M+KC+S IYCKL++PP + H GLSSP SS S 
Subjt:  LSHYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPP-LSHHGLSSPVSSSSP

Query:  ISAFSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKL
         SAFSP DQ+D  SPGF N+ SFD+ LDN FHVEG K+FSGPYS++VEV  IYRD++K  E+                    + F +LIS+LEEVDPRKL
Subjt:  ISAFSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKL

Query:  NHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHF
         HEEKLAFWINVHN+L+MHA+LAYGIPQNN+K+V LLLKAAYNIGGHTIS + IQSSILGC+M  PGQWL LL  S+ KFK GD+R AY I  PEPLLHF
Subjt:  NHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHF

Query:  ALCTGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQN--GKSRKNVEWIPHN
        AL +G HSDPAVRVYTPKR+ QELE +KEEY+R    IRK R ++LPK++E+FAK+  LC  G+ EM+  S+PES RK + +CQ+   K RK ++WIPH+
Subjt:  ALCTGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQN--GKSRKNVEWIPHN

Query:  FTFRYLISREMVK
        FTFRYLI RE  K
Subjt:  FTFRYLISREMVK

AT5G66600.3 Protein of unknown function, DUF5472.9e-16854.16Show/hide
Query:  FPDKDTFE-EEQLDVSFEASNRIKL---------------------------LEKRLQDQFKVRTALEKALGYRSSSQ---DNTNDMEVPKPATELIREI
        FP+K   E ++  + S EAS R+KL                           LE RLQDQFKVR ALEKALGYR++S      TND+ +PKPAT+LI+++
Subjt:  FPDKDTFE-EEQLDVSFEASNRIKL---------------------------LEKRLQDQFKVRTALEKALGYRSSSQ---DNTNDMEVPKPATELIREI

Query:  AVLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKS-PMNSPTAKYIV---NDIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSS
        AVLE+EV HLEQYLLSLYRKAF+  I+S SP+  ++K KS P+ +P  +      +D  SK +   V    D      K      + + +D S RR H  
Subjt:  AVLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKS-PMNSPTAKYIV---NDIASKKEDKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSS

Query:  LSHYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPP-LSHHGLSSPVSSSSP
         S  S F  R + PEDS G+  R C SQPL    + QN   NL+SLAE+LGT+ISDH+P++ N+LSE M+KC+S IYCKL++PP + H GLSSP SS S 
Subjt:  LSHYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYCKLSDPP-LSHHGLSSPVSSSSP

Query:  ISAFSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKL
         SAFSP DQ+D  SPGF N+ SFD+ LDN FHVEG K+FSGPYS++VEV  IYRD++K  E+                    + F +LIS+LEEVDPRKL
Subjt:  ISAFSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKL

Query:  NHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHF
         HEEKLAFWINVHN+L+MHA+LAYGIPQNN+K+V LLLKAAYNIGGHTIS + IQSSILGC+M  PGQWL LL  S+ KFK GD+R AY I  PEPLLHF
Subjt:  NHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHF

Query:  ALCTGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQN--GKSRKNVEWIPHN
        AL +G HSDPAVRVYTPKR+ QELE +KEEY+R    IRK R ++LPK++E+FAK+  LC  G+ EM+  S+PES RK + +CQ+   K RK ++WIPH+
Subjt:  ALCTGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQN--GKSRKNVEWIPHN

Query:  FTFRYLISREMVK
        FTFRYLI RE  K
Subjt:  FTFRYLISREMVK

AT5G66600.4 Protein of unknown function, DUF5476.4e-16852Show/hide
Query:  GFGNLWQLAHLLPSSSSASSDLQSLYCFSLSTFSIIGFPDKDTFE-EEQLDVSFEASNRIKL---------------------------LEKRLQDQFKV
        G G +  L  +     S  S + +++C      +   FP+K   E ++  + S EAS R+KL                           LE RLQDQFKV
Subjt:  GFGNLWQLAHLLPSSSSASSDLQSLYCFSLSTFSIIGFPDKDTFE-EEQLDVSFEASNRIKL---------------------------LEKRLQDQFKV

Query:  RTALEKALGYRSSSQ---DNTNDMEVPKPATELIREIAVLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKS-PMNSPTAKYIV---NDIASKKE
        R ALEKALGYR++S      TND+ +PKPAT+LI+++AVLE+EV HLEQYLLSLYRKAF+  I+S SP+  ++K KS P+ +P  +      +D  SK +
Subjt:  RTALEKALGYRSSSQ---DNTNDMEVPKPATELIREIAVLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKS-PMNSPTAKYIV---NDIASKKE

Query:  DKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSSLSHYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSAN
           V    D      K      + + +D S RR H   S  S F  R + PEDS G+  R C SQPL    + QN   NL+SLAE+LGT+ISDH+P++ N
Subjt:  DKAVQSGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSSLSHYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSAN

Query:  RLSEDMIKCISAIYCKLSDPP-LSHHGLSSPVSSSSPISAFSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMS
        +LSE M+KC+S IYCKL++PP + H GLSSP SS S  SAFSP DQ+D  SPGF N+ SFD+ LDN FHVEG K+FSGPYS++VEV  IYRD++K  E+ 
Subjt:  RLSEDMIKCISAIYCKLSDPP-LSHHGLSSPVSSSSPISAFSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMS

Query:  TFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRM
                           + F +LIS+LEEVDPRKL HEEKLAFWINVHN+L+MHA+LAYGIPQNN+K+V LLLKAAYNIGGHTIS + IQSSILGC+M
Subjt:  TFTRFQVGNIWMFICHVNWKIFLTLISKLEEVDPRKLNHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRM

Query:  PRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALCTGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTG
          PGQWL LL  S+ KFK GD+R AY I  PEPLLHFAL +G HSDPAVRVYTPKR+ QELE +KEEY+R    IRK R ++LPK++E+FAK+  LC  G
Subjt:  PRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLHFALCTGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTG

Query:  MMEMIQESLPESLRKSITKCQN--GKSRKNVEWIPHNFTFRYLISREMVK
        + EM+  S+PES RK + +CQ+   K RK ++WIPH+FTFRYLI RE  K
Subjt:  MMEMIQESLPESLRKSITKCQN--GKSRKNVEWIPHNFTFRYLISREMVK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCTGGTTCTCAGATATGCTGTTTCAAGAGGCAGAAGACGTTGCAAAAATGCTGCAAAACAGAGATACTCCACGACACAACCGTTGCAAGAGGTAGCTTCTGATGC
TGCTGAGTTGGTTGAAGAGTTGAAAAGCTTATCTGAGGTGGGTTTTGGAAACTTGTGGCAATTAGCTCACCTGTTGCCTTCCTCCTCTTCTGCTTCTTCAGATCTGCAAT
CTTTATACTGTTTCAGTCTTTCCACCTTTTCTATTATTGGCTTTCCAGACAAGGATACTTTTGAGGAAGAACAGTTGGATGTCTCCTTTGAAGCATCGAATCGAATAAAG
CTGCTTGAAAAGAGATTGCAAGACCAGTTCAAGGTTCGCACGGCACTAGAAAAAGCATTAGGATATAGGTCCTCTTCTCAAGACAACACAAACGACATGGAAGTACCCAA
GCCTGCCACCGAATTGATCAGAGAGATTGCGGTATTAGAACTGGAAGTTGCACATTTAGAACAATATCTTCTCTCTTTGTATCGAAAAGCATTTGATGGACACATAGCTT
CTACATCTCCATCTACCAACGATGAAAAACTAAAATCGCCTATGAACTCCCCTACAGCCAAGTATATAGTGAATGACATTGCATCAAAAAAGGAAGATAAAGCTGTTCAG
TCTGGTTATGATGCTTTTGGAGATCCAGTAAAGGAATATAGCGGAATTGGCGAAGACAATCTGTTGGACTCCAGCGTCCGTCGTTGTCACTCATCTCTATCACATTATTC
AGTTTTCTCAAAGAGAGCTTCTTTTCCGGAAGATTCTTTGGGTAGGGATGTACGGCCTTGTCTTTCCCAACCACTGTCCATGATGGAGTTTGCCCAGAATGCTTCATCCA
ATTTAGTTAGTCTAGCAGAGTACCTTGGTACTCAAATATCAGACCATATTCCACAGAGTGCCAACCGTCTTTCTGAAGATATGATCAAGTGCATATCAGCCATATATTGC
AAACTTTCAGACCCTCCTTTATCACATCACGGTCTTTCCTCTCCAGTTTCCTCATCATCGCCAATCAGTGCATTCTCTCCACAAGATCAGTTTGACATGTTGAGTCCTGG
GTTCAGAAATAATCCATCTTTCGATATACTACTGGATAATCCTTTTCACGTGGAAGGACTTAAAGAGTTTAGCGGCCCGTACAGCACAATGGTTGAAGTGCCATGGATTT
ATAGGGATAGTCAGAAATTGATCGAAATGAGCACTTTTACAAGATTTCAGGTCGGCAACATATGGATGTTCATATGTCATGTTAACTGGAAGATTTTTTTGACGCTTATA
TCCAAGTTAGAGGAAGTAGATCCAAGGAAGTTGAATCATGAGGAAAAGCTCGCCTTCTGGATCAACGTACACAATTCTCTGATGATGCATGCATACTTGGCTTATGGTAT
ACCACAAAACAATATGAAGAAGGTCTTTCTTCTCTTGAAGGCTGCATATAACATAGGTGGTCACACAATCAGTGTAGACACCATACAAAGTTCGATACTTGGATGTCGGA
TGCCTCGGCCTGGACAGTGGCTCTCTCTTCTTCTTCCCTCGAAGTCAAAATTCAAAAATGGAGATAAACGGCAAGCCTACAAAATCCACCGGCCAGAACCCCTCTTACAT
TTCGCACTTTGTACAGGCTGCCATTCTGATCCTGCGGTTCGTGTATACACACCCAAAAGAGTTCTGCAGGAGCTAGAGATAGCAAAAGAAGAATACGTTCGAGCCACATT
CGGTATACGCAAGGACCGAAAAGTCATTCTACCCAAGATCATCGAGTCATTTGCAAAAGAATTAAGATTGTGCACAACTGGTATGATGGAAATGATCCAAGAGTCTTTGC
CTGAATCTCTTAGGAAGAGCATTACAAAATGCCAAAACGGGAAGTCCCGCAAGAACGTTGAATGGATACCGCACAACTTCACTTTTCGGTATCTGATATCTAGAGAAATG
GTGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCTGGTTCTCAGATATGCTGTTTCAAGAGGCAGAAGACGTTGCAAAAATGCTGCAAAACAGAGATACTCCACGACACAACCGTTGCAAGAGGTAGCTTCTGATGC
TGCTGAGTTGGTTGAAGAGTTGAAAAGCTTATCTGAGGTGGGTTTTGGAAACTTGTGGCAATTAGCTCACCTGTTGCCTTCCTCCTCTTCTGCTTCTTCAGATCTGCAAT
CTTTATACTGTTTCAGTCTTTCCACCTTTTCTATTATTGGCTTTCCAGACAAGGATACTTTTGAGGAAGAACAGTTGGATGTCTCCTTTGAAGCATCGAATCGAATAAAG
CTGCTTGAAAAGAGATTGCAAGACCAGTTCAAGGTTCGCACGGCACTAGAAAAAGCATTAGGATATAGGTCCTCTTCTCAAGACAACACAAACGACATGGAAGTACCCAA
GCCTGCCACCGAATTGATCAGAGAGATTGCGGTATTAGAACTGGAAGTTGCACATTTAGAACAATATCTTCTCTCTTTGTATCGAAAAGCATTTGATGGACACATAGCTT
CTACATCTCCATCTACCAACGATGAAAAACTAAAATCGCCTATGAACTCCCCTACAGCCAAGTATATAGTGAATGACATTGCATCAAAAAAGGAAGATAAAGCTGTTCAG
TCTGGTTATGATGCTTTTGGAGATCCAGTAAAGGAATATAGCGGAATTGGCGAAGACAATCTGTTGGACTCCAGCGTCCGTCGTTGTCACTCATCTCTATCACATTATTC
AGTTTTCTCAAAGAGAGCTTCTTTTCCGGAAGATTCTTTGGGTAGGGATGTACGGCCTTGTCTTTCCCAACCACTGTCCATGATGGAGTTTGCCCAGAATGCTTCATCCA
ATTTAGTTAGTCTAGCAGAGTACCTTGGTACTCAAATATCAGACCATATTCCACAGAGTGCCAACCGTCTTTCTGAAGATATGATCAAGTGCATATCAGCCATATATTGC
AAACTTTCAGACCCTCCTTTATCACATCACGGTCTTTCCTCTCCAGTTTCCTCATCATCGCCAATCAGTGCATTCTCTCCACAAGATCAGTTTGACATGTTGAGTCCTGG
GTTCAGAAATAATCCATCTTTCGATATACTACTGGATAATCCTTTTCACGTGGAAGGACTTAAAGAGTTTAGCGGCCCGTACAGCACAATGGTTGAAGTGCCATGGATTT
ATAGGGATAGTCAGAAATTGATCGAAATGAGCACTTTTACAAGATTTCAGGTCGGCAACATATGGATGTTCATATGTCATGTTAACTGGAAGATTTTTTTGACGCTTATA
TCCAAGTTAGAGGAAGTAGATCCAAGGAAGTTGAATCATGAGGAAAAGCTCGCCTTCTGGATCAACGTACACAATTCTCTGATGATGCATGCATACTTGGCTTATGGTAT
ACCACAAAACAATATGAAGAAGGTCTTTCTTCTCTTGAAGGCTGCATATAACATAGGTGGTCACACAATCAGTGTAGACACCATACAAAGTTCGATACTTGGATGTCGGA
TGCCTCGGCCTGGACAGTGGCTCTCTCTTCTTCTTCCCTCGAAGTCAAAATTCAAAAATGGAGATAAACGGCAAGCCTACAAAATCCACCGGCCAGAACCCCTCTTACAT
TTCGCACTTTGTACAGGCTGCCATTCTGATCCTGCGGTTCGTGTATACACACCCAAAAGAGTTCTGCAGGAGCTAGAGATAGCAAAAGAAGAATACGTTCGAGCCACATT
CGGTATACGCAAGGACCGAAAAGTCATTCTACCCAAGATCATCGAGTCATTTGCAAAAGAATTAAGATTGTGCACAACTGGTATGATGGAAATGATCCAAGAGTCTTTGC
CTGAATCTCTTAGGAAGAGCATTACAAAATGCCAAAACGGGAAGTCCCGCAAGAACGTTGAATGGATACCGCACAACTTCACTTTTCGGTATCTGATATCTAGAGAAATG
GTGAAGTGA
Protein sequenceShow/hide protein sequence
MELVLRYAVSRGRRRCKNAAKQRYSTTQPLQEVASDAAELVEELKSLSEVGFGNLWQLAHLLPSSSSASSDLQSLYCFSLSTFSIIGFPDKDTFEEEQLDVSFEASNRIK
LLEKRLQDQFKVRTALEKALGYRSSSQDNTNDMEVPKPATELIREIAVLELEVAHLEQYLLSLYRKAFDGHIASTSPSTNDEKLKSPMNSPTAKYIVNDIASKKEDKAVQ
SGYDAFGDPVKEYSGIGEDNLLDSSVRRCHSSLSHYSVFSKRASFPEDSLGRDVRPCLSQPLSMMEFAQNASSNLVSLAEYLGTQISDHIPQSANRLSEDMIKCISAIYC
KLSDPPLSHHGLSSPVSSSSPISAFSPQDQFDMLSPGFRNNPSFDILLDNPFHVEGLKEFSGPYSTMVEVPWIYRDSQKLIEMSTFTRFQVGNIWMFICHVNWKIFLTLI
SKLEEVDPRKLNHEEKLAFWINVHNSLMMHAYLAYGIPQNNMKKVFLLLKAAYNIGGHTISVDTIQSSILGCRMPRPGQWLSLLLPSKSKFKNGDKRQAYKIHRPEPLLH
FALCTGCHSDPAVRVYTPKRVLQELEIAKEEYVRATFGIRKDRKVILPKIIESFAKELRLCTTGMMEMIQESLPESLRKSITKCQNGKSRKNVEWIPHNFTFRYLISREM
VK