; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Bhi08G000196 (gene) of Wax gourd (B227) v1 genome

Gene IDBhi08G000196
OrganismBenincasa hispida cv. B227 (Wax gourd (B227) v1)
DescriptionProtein of unknown function, DUF547
Genome locationchr8:8091773..8102743
RNA-Seq ExpressionBhi08G000196
SyntenyBhi08G000196
Gene Ontology termsGO:0016853 - isomerase activity (molecular function)
InterPro domainsIPR006869 - Domain of unknown function DUF547
IPR025757 - Ternary complex factor MIP1, leucine-zipper


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607152.1 hypothetical protein SDJN03_00494, partial [Cucurbita argyrosperma subsp. sororia]9.5e-17280.34Show/hide
Query:  IAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMS-NLLPPKVKEVLAELAMVESEIARLEIQITQLQKDLK
        +A P PPHSPSQ++KKK+SGQRKKEELEREVLMLQKLL+QEEKVHEIL+G+  QQNGS L +S N LPPKVKE+LAELAMVESEI RLEIQIT+LQKDLK
Subjt:  IAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMS-NLLPPKVKEVLAELAMVESEIARLEIQITQLQKDLK

Query:  TEQQHTTTKSKQWSCEQPQTNNN--NKPPMGWNPISRATFDTKALHFISKAIKGDYALNHFKLDNAKNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLV
        +E++   T+SK WS EQP   NN  NKPP+ WNPIS+ TFDTK LHFISKAIKGDYALNHF+LD AKN +    D K  H    E+KL ERV RKSGLLV
Subjt:  TEQQHTTTKSKQWSCEQPQTNNN--NKPPMGWNPISRATFDTKALHFISKAIKGDYALNHFKLDNAKNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLV

Query:  ASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSS
          SPLR+P+HPSPK+RERS L MPP K + MPIQ EENIQNWHPNKLSESI+KCLNF+YVRLLR SRTMELEKSGPISRSLH SSLSSRSFRVENGLNS 
Subjt:  ASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSS

Query:  LSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPS
        LS+HKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSI+SATFIPLI KLRVLMSNLQ VDL+PL+YQQKLAFWINMYNACIMNGFLQYGVPS
Subjt:  LSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPS

Query:  SPEKLATLMNKA
        SPEKLA L+NKA
Subjt:  SPEKLATLMNKA

XP_004139551.1 uncharacterized protein LOC101221529 [Cucumis sativus]5.9e-19889.4Show/hide
Query:  MAHIAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKD
        MAHI  PPPPHSPSQFLKKK+SGQRKKEELEREVLMLQKLLNQEEK+HEILEGV+KQQNGSA+G+SNLLPPKVKE+LAELAMVESEIARLEIQITQLQKD
Subjt:  MAHIAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKD

Query:  LKTEQQHTTTKSKQWSCEQPQTNNNNKPPMGWNPISRATFDTKALHFISKAIKGDYA-LN-HFKLDNAKNSESGPTDTKDNHHLLPEVKLHER-VSRKSG
        LK EQQ  TTKSKQWS EQ    NNNKPP+ WNPIS+ TFDTKALHFISKAIKGDYA LN HFKLD +KN+E  P D KD+HH L EVKLHER VSRKSG
Subjt:  LKTEQQHTTTKSKQWSCEQPQTNNNNKPPMGWNPISRATFDTKALHFISKAIKGDYA-LN-HFKLDNAKNSESGPTDTKDNHHLLPEVKLHER-VSRKSG

Query:  LLVASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGL
        LLVASSPLRDPRHPSPKQRER+ LD+P PKS+PM  QAEENIQNWHPNKLSESIMKCLNF+YVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGL
Subjt:  LLVASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGL

Query:  NSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYG
        NSSLS HKELRQQDPYGIFENEES+PRDIGPYKNLVIFTSTSMDPKSISSATFIPL+RKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYG
Subjt:  NSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYG

Query:  VPSSPEKLATLMNKA
        VPSSPEKLATLMNKA
Subjt:  VPSSPEKLATLMNKA

XP_008462917.1 PREDICTED: uncharacterized protein LOC103501181 isoform X1 [Cucumis melo]8.8e-20291.08Show/hide
Query:  MAHIAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKD
        MAHI   PPPHSPSQFLKKK+SGQRKKEELEREVLMLQKLLNQEEKVHEILEG+NKQQNGSA+G+SNLLPPKVKE+LAELAMVESEIARLEIQITQL+KD
Subjt:  MAHIAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKD

Query:  LKTEQQHTTTKSKQWSCE-QPQTNNNNKPPMGWNPISRATFDTKALHFISKAIKGDYALN-HFKLDNAKNSESGPTDTKDNHHLLPEVKLHER-VSRKSG
        LK EQQH TTKSKQWS E QPQTNNNNKPP+ WNPIS+ TFDTKALHFISKAIKGDYALN HFKLDN+KN+E  P D KD+HH L EVKLHER VSRKSG
Subjt:  LKTEQQHTTTKSKQWSCE-QPQTNNNNKPPMGWNPISRATFDTKALHFISKAIKGDYALN-HFKLDNAKNSESGPTDTKDNHHLLPEVKLHER-VSRKSG

Query:  LLVASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGL
        LLVASSPLRDPRHPSPKQRER+ LD+P PKSMPM  QAEENIQNWHPNKLSESIMKCLNF+YVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGL
Subjt:  LLVASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGL

Query:  NSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYG
        NSSLS HKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPL+RKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYG
Subjt:  NSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYG

Query:  VPSSPEKLATLMNKA
        VPSSPEKLATLMNKA
Subjt:  VPSSPEKLATLMNKA

XP_008462925.1 PREDICTED: uncharacterized protein LOC103501181 isoform X2 [Cucumis melo]8.8e-18691.32Show/hide
Query:  MLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQHTTTKSKQWSCE-QPQTNNNNKPPMGWNP
        MLQKLLNQEEKVHEILEG+NKQQNGSA+G+SNLLPPKVKE+LAELAMVESEIARLEIQITQL+KDLK EQQH TTKSKQWS E QPQTNNNNKPP+ WNP
Subjt:  MLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQHTTTKSKQWSCE-QPQTNNNNKPPMGWNP

Query:  ISRATFDTKALHFISKAIKGDYALN-HFKLDNAKNSESGPTDTKDNHHLLPEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERSSLDMPPPKSMPMP
        IS+ TFDTKALHFISKAIKGDYALN HFKLDN+KN+E  P D KD+HH L EVKLHER VSRKSGLLVASSPLRDPRHPSPKQRER+ LD+P PKSMPM 
Subjt:  ISRATFDTKALHFISKAIKGDYALN-HFKLDNAKNSESGPTDTKDNHHLLPEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERSSLDMPPPKSMPMP

Query:  IQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNL
         QAEENIQNWHPNKLSESIMKCLNF+YVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLS HKELRQQDPYGIFENEESIPRDIGPYKNL
Subjt:  IQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNL

Query:  VIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKA
        VIFTSTSMDPKSISSATFIPL+RKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKA
Subjt:  VIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKA

XP_038894153.1 uncharacterized protein LOC120082868 [Benincasa hispida]4.2e-228100Show/hide
Query:  MAHIAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKD
        MAHIAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKD
Subjt:  MAHIAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKD

Query:  LKTEQQHTTTKSKQWSCEQPQTNNNNKPPMGWNPISRATFDTKALHFISKAIKGDYALNHFKLDNAKNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLV
        LKTEQQHTTTKSKQWSCEQPQTNNNNKPPMGWNPISRATFDTKALHFISKAIKGDYALNHFKLDNAKNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLV
Subjt:  LKTEQQHTTTKSKQWSCEQPQTNNNNKPPMGWNPISRATFDTKALHFISKAIKGDYALNHFKLDNAKNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLV

Query:  ASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSS
        ASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSS
Subjt:  ASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSS

Query:  LSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPS
        LSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPS
Subjt:  LSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPS

Query:  SPEKLATLMNKA
        SPEKLATLMNKA
Subjt:  SPEKLATLMNKA

TrEMBL top hitse value%identityAlignment
A0A0A0LSP4 Uncharacterized protein1.2e-18588.83Show/hide
Query:  MAHIAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKD
        MAHI  PPPPHSPSQFLKKK+SGQRKKEELEREVLMLQKLLNQEEK+HEILEGV+KQQNGSA+G+SNLLPPKVKE+LAELAMVESEIARLEIQITQLQKD
Subjt:  MAHIAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKD

Query:  LKTEQQHTTTKSKQWSCEQPQTNNNNKPPMGWNPISRATFDTKALHFISKAIKGDYA-LN-HFKLDNAKNSESGPTDTKDNHHLLPEVKLHER-VSRKSG
        LK EQQ  TTKSKQWS EQ    NNNKPP+ WNPIS+ TFDTKALHFISKAIKGDYA LN HFKLD +KN+E  P D KD+HH L EVKLHER VSRKSG
Subjt:  LKTEQQHTTTKSKQWSCEQPQTNNNNKPPMGWNPISRATFDTKALHFISKAIKGDYA-LN-HFKLDNAKNSESGPTDTKDNHHLLPEVKLHER-VSRKSG

Query:  LLVASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGL
        LLVASSPLRDPRHPSPKQRER+ LD+P PKS+PM  QAEENIQNWHPNKLSESIMKCLNF+YVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGL
Subjt:  LLVASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGL

Query:  NSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMN
        NSSLS HKELRQQDPYGIFENEES+PRDIGPYKNLVIFTSTSMDPKSISSATFIPL+RKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMN
Subjt:  NSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMN

A0A1S3CHZ3 uncharacterized protein LOC103501181 isoform X14.2e-20291.08Show/hide
Query:  MAHIAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKD
        MAHI   PPPHSPSQFLKKK+SGQRKKEELEREVLMLQKLLNQEEKVHEILEG+NKQQNGSA+G+SNLLPPKVKE+LAELAMVESEIARLEIQITQL+KD
Subjt:  MAHIAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKD

Query:  LKTEQQHTTTKSKQWSCE-QPQTNNNNKPPMGWNPISRATFDTKALHFISKAIKGDYALN-HFKLDNAKNSESGPTDTKDNHHLLPEVKLHER-VSRKSG
        LK EQQH TTKSKQWS E QPQTNNNNKPP+ WNPIS+ TFDTKALHFISKAIKGDYALN HFKLDN+KN+E  P D KD+HH L EVKLHER VSRKSG
Subjt:  LKTEQQHTTTKSKQWSCE-QPQTNNNNKPPMGWNPISRATFDTKALHFISKAIKGDYALN-HFKLDNAKNSESGPTDTKDNHHLLPEVKLHER-VSRKSG

Query:  LLVASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGL
        LLVASSPLRDPRHPSPKQRER+ LD+P PKSMPM  QAEENIQNWHPNKLSESIMKCLNF+YVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGL
Subjt:  LLVASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGL

Query:  NSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYG
        NSSLS HKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPL+RKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYG
Subjt:  NSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYG

Query:  VPSSPEKLATLMNKA
        VPSSPEKLATLMNKA
Subjt:  VPSSPEKLATLMNKA

A0A1S3CJL7 uncharacterized protein LOC103501181 isoform X24.3e-18691.32Show/hide
Query:  MLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQHTTTKSKQWSCE-QPQTNNNNKPPMGWNP
        MLQKLLNQEEKVHEILEG+NKQQNGSA+G+SNLLPPKVKE+LAELAMVESEIARLEIQITQL+KDLK EQQH TTKSKQWS E QPQTNNNNKPP+ WNP
Subjt:  MLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQHTTTKSKQWSCE-QPQTNNNNKPPMGWNP

Query:  ISRATFDTKALHFISKAIKGDYALN-HFKLDNAKNSESGPTDTKDNHHLLPEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERSSLDMPPPKSMPMP
        IS+ TFDTKALHFISKAIKGDYALN HFKLDN+KN+E  P D KD+HH L EVKLHER VSRKSGLLVASSPLRDPRHPSPKQRER+ LD+P PKSMPM 
Subjt:  ISRATFDTKALHFISKAIKGDYALN-HFKLDNAKNSESGPTDTKDNHHLLPEVKLHER-VSRKSGLLVASSPLRDPRHPSPKQRERSSLDMPPPKSMPMP

Query:  IQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNL
         QAEENIQNWHPNKLSESIMKCLNF+YVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLS HKELRQQDPYGIFENEESIPRDIGPYKNL
Subjt:  IQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNL

Query:  VIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKA
        VIFTSTSMDPKSISSATFIPL+RKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKA
Subjt:  VIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKA

A0A6J1G939 uncharacterized protein LOC111452055 isoform X11.1e-16879.37Show/hide
Query:  IAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMS-NLLPPKVKEVLAELAMVESEIARLEIQITQLQKDLK
        +A P PPHSPSQ++KKK+SGQRKKEELEREVLMLQKLL+QEEKVHEIL+G+  QQN S L +S N LPPKVKE+LAELAMVESEI RLEIQIT+LQKDLK
Subjt:  IAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMS-NLLPPKVKEVLAELAMVESEIARLEIQITQLQKDLK

Query:  TEQQHTTTKSKQWSCEQPQTNNN--NKPPMGWNPISRATFDTKALHFISKAIKGDYALNHFKLDNAKNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLV
        +E+Q   T+SK WS EQP   NN   KPP+ WNPIS+ TFDTK LHFISKAIKGDYALN F+LD AKN +    D K  H    E+KL ERV RKSGLLV
Subjt:  TEQQHTTTKSKQWSCEQPQTNNN--NKPPMGWNPISRATFDTKALHFISKAIKGDYALNHFKLDNAKNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLV

Query:  ASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSS
          SPLR+P+HPSPK+RERS L MPP K + MPIQ EENIQNWHPNKLSESI+KCLNF+YVRLLR SRTMELEKSGPISRSLH SSLSSRSFRVENGLNS 
Subjt:  ASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSS

Query:  LSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPS
        LS+HKELRQQDPY IFENEESIPRDIGPYKNLVIFTSTSMDPKSI+SATFIPLI KLRVLMSNLQ VDL+PL+YQQKLAFWINMYNACIMNGFL YGVPS
Subjt:  LSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPS

Query:  SPEKLATLMNKA
        SPEKLA L+NKA
Subjt:  SPEKLATLMNKA

A0A6J1KFI3 uncharacterized protein LOC1114933972.2e-16678.64Show/hide
Query:  IAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMS-NLLPPKVKEVLAELAMVESEIARLEIQITQLQKDLK
        +A P PPHSPSQ++KKK+SGQRKKEELEREVLMLQKLL+QEEKVHEIL G+  QQN S L +S N LPPKVKE+L ELAMVESEI RLEIQI +LQKDLK
Subjt:  IAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMS-NLLPPKVKEVLAELAMVESEIARLEIQITQLQKDLK

Query:  TEQQHTTTKSKQWSCEQPQTNNN--NKPPMGWNPISRATFDTKALHFISKAIKGDYALNHFKLDNAKNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLV
        +E+Q   T+SK WS EQ    NN  NKPP+ WNPIS+ TFDTK LHFISKAIKGDYALNHF+LD AKN +    D K       E+KL ERV RKSGLLV
Subjt:  TEQQHTTTKSKQWSCEQPQTNNN--NKPPMGWNPISRATFDTKALHFISKAIKGDYALNHFKLDNAKNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLV

Query:  ASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSS
          SPLR+P+HPSPK+RERS L MP  K + MPI  EENIQNWHPNKLSESI+KCLNF+YVRLLR SRTMELEKSGPISRSLH SSLSSRSFRVENGLNS 
Subjt:  ASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSS

Query:  LSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPS
        LS+HKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSI+SATFIPLI KLRVLM+NLQ VDL+PL+YQQKLAFWINMYNACIMNGFLQYGVPS
Subjt:  LSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPS

Query:  SPEKLATLMNKA
        SPEKLA L+NKA
Subjt:  SPEKLATLMNKA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39690.1 Protein of unknown function, DUF5471.5e-2628.57Show/hide
Query:  LQKLLNQEEKVHEILEGVNKQQNGSALGMSNL-LPPKVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQHTTTKSKQWSCEQPQTNNNNKPPMGWNPI
        L++ L +EE V   L         S   +S+L LPP+  E++ ELA+VE+EI  L+ +I +L+  L +EQ+ T     Q + ++      +       P+
Subjt:  LQKLLNQEEKVHEILEGVNKQQNGSALGMSNL-LPPKVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQHTTTKSKQWSCEQPQTNNNNKPPMGWNPI

Query:  SRATFDTKALHFISKAIKGDYALNHFKLDNA-------KNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERSSLDMPPPKS
                  H   ++     +  H +L  +         S  G TD  D       V   +    + GL +  +  +D                     
Subjt:  SRATFDTKALHFISKAIKGDYALNHFKLDNA-------KNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERSSLDMPPPKS

Query:  MPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIF-ENEESIPRDIG
                       PN++SE ++ CL  +Y+ L   S     +  G +S S   SS S +S        ++ S ++     DPY +  ++   + RDIG
Subjt:  MPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIF-ENEESIPRDIG

Query:  PYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKA
        PYKN +  + +S+D    +     P + +L VLM  L +VDL  L+Y+QKLAFWIN+YNACIM+ FL+YG+PSS  +L TLMNKA
Subjt:  PYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKA

AT3G12540.1 Protein of unknown function, DUF5471.8e-2729.82Show/hide
Query:  LMLQKLLNQEEKVHEILEGVNKQQNGSALGMS-NLLPPKVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQHTTTKSKQWSCEQPQTNNNNKPPMGWN
        + +Q  L  E+ +++ L  +++    S   +S  LLPP+V+E++ ELA VE+EI  LE +I  L+ D+ +E++    K  + S ++ +      P     
Subjt:  LMLQKLLNQEEKVHEILEGVNKQQNGSALGMS-NLLPPKVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQHTTTKSKQWSCEQPQTNNNNKPPMGWN

Query:  PISRATFDTKALHFISKAIKGDYALNHFKLDNAKNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPI
                 K L      +  D   +  K+ +    +   + +  +HH++ +++++   S ++   + S+     R  S    +  S             
Subjt:  PISRATFDTKALHFISKAIKGDYALNHFKLDNAKNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERSSLDMPPPKSMPMPI

Query:  QAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLV
        Q + N+Q   PN +SE ++KCL  +Y+ L R+SR  E E S  +S+ L  + L + SF+ ++  + + S        DPYG         RDIG YKN +
Subjt:  QAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYKNLV

Query:  IFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKA
          T TS+D   +S  +    +  LRVL   L KVDL  L++++K+AFWIN YNAC+MNGFL++G+PSS EKL T++  A
Subjt:  IFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKA

AT5G42690.1 Protein of unknown function, DUF5473.4e-2627.27Show/hide
Query:  LKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQHTTTKSKQWS
        + +K   + K   L+ +V  L+K L  EE +H  +E    +  G+   +   LPP V E+LAE+A++E E+ RLE  I   +++L  E   T++  +   
Subjt:  LKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQHTTTKSKQWS

Query:  CEQPQTNNNNKPPMGWNPISRATFDTKALHFISKAIKGDYALNHFKLDNAKNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLVASSPLRDPRHPSPKQR
        C           P  W   S++   T A    S   +   +++  +    K  E+  + T     +      H ++++     + +  L+D         
Subjt:  CEQPQTNNNNKPPMGWNPISRATFDTKALHFISKAIKGDYALNHFKLDNAKNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLVASSPLRDPRHPSPKQR

Query:  ERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIF
        ERSS                 +     PNK+SE ++KCL+ +++R+    R+M                           +  S    K+   +DPYGI 
Subjt:  ERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIF

Query:  ENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKA
         +     RDIG YKN       S++    SS++   LIR+L+ L+  L  V+++ L+ Q+KLAFWIN+YN+C+MNGFL++G+P SP+ + TLM KA
Subjt:  ENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKA

AT5G42690.2 Protein of unknown function, DUF5475.8e-2627.53Show/hide
Query:  LKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQHTTTKSKQWS
        + +K   + K   L+ +V  L+K L  EE +H  +E    +  G+   +   LPP V E+LAE+A++E E+ RLE  I   +++L  E   T++  +   
Subjt:  LKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQHTTTKSKQWS

Query:  CEQPQTNNNNKPPMGWNPISRATFDTKALHFISKAIKGDYALNHFKLDNAKNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLVASSPLRDPRHPSPKQR
        C               +P     + TK     SK+       +   L  A  S S     K+N      +K   + +      +A + L        K  
Subjt:  CEQPQTNNNNKPPMGWNPISRATFDTKALHFISKAIKGDYALNHFKLDNAKNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLVASSPLRDPRHPSPKQR

Query:  ERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIF
        E   L     +      +   +     PNK+SE ++KCL+ +++R+    R+M                           +  S    K+   +DPYGI 
Subjt:  ERSSLDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIF

Query:  ENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKA
         +     RDIG YKN       S++    SS++   LIR+L+ L+  L  V+++ L+ Q+KLAFWIN+YN+C+MNGFL++G+P SP+ + TLM KA
Subjt:  ENEESIPRDIGPYKNLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKA

AT5G60720.1 Protein of unknown function, DUF5479.6e-9849.36Show/hide
Query:  IAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQ---NGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKD
        I+P   PHS      KK +GQ+KKEE+E+EV ML+++L+QEEK  EILE V K Q   + S+L +   LPPK+KE++ EL++VE EI+RLEIQI+ LQ +
Subjt:  IAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQ---NGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKD

Query:  LKTEQ-----QHTTTKSKQ-WSCEQ-------------------------PQTN-------NNNKPPMGWNPISRATFDTKALHFISKAIKGDYALNHFK
        LK EQ     Q TT+ S++ W   +                         P  N       NNN      +    ATF TK LHFI+KAIKGDYA+  F+
Subjt:  LKTEQ-----QHTTTKSKQ-WSCEQ-------------------------PQTN-------NNNKPPMGWNPISRATFDTKALHFISKAIKGDYALNHFK

Query:  LDNAKNSESGPTDTKDNHHLLPEVKLHERVS-RKSGLLVASSPLRDPRHPSPKQRER-------SSLDMPPPKSMPMPIQAE--ENIQNWHPNKLSESIM
          N K    G  + +++  +  E K+ E +  +K   + + SPLR+PR+ SP +  +       +SLD+ PPKS+   I  E  +NIQ WHPNKL+E+IM
Subjt:  LDNAKNSESGPTDTKDNHHLLPEVKLHERVS-RKSGLLVASSPLRDPRHPSPKQRER-------SSLDMPPPKSMPMPIQAE--ENIQNWHPNKLSESIM

Query:  KCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNS-----SLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISS
        KCLNF+YVRLLR +R MELEK+GPISRS ++ SLSSRSFRV+N  +S     +L  +KE RQQDPYGIF+ E S+ RDIGPYKNLVIFTS+SMD K ISS
Subjt:  KCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNS-----SLSVHKELRQQDPYGIFENEESIPRDIGPYKNLVIFTSTSMDPKSISS

Query:  ATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLM-NKA
        ++ + LI+KLRVLM+NL+ VDL+ LS+QQKLAFWINM+NAC+M+G+LQ+GVP + E+L +L+ NKA
Subjt:  ATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLM-NKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCACATTGCTCCTCCTCCTCCTCCTCACTCTCCCTCCCAATTTCTGAAGAAGAAGATTAGTGGGCAAAGAAAGAAGGAGGAGCTTGAAAGAGAGGTGTTGATGCT
TCAAAAATTATTAAATCAGGAAGAAAAGGTGCATGAGATTTTAGAAGGTGTTAATAAACAGCAAAATGGTTCAGCACTTGGCATGTCAAATTTGCTTCCTCCCAAGGTAA
AGGAAGTGTTGGCAGAACTAGCAATGGTGGAAAGTGAAATAGCAAGGCTTGAGATTCAAATAACTCAACTCCAAAAGGACTTGAAAACTGAGCAACAACATACCACAACA
AAGTCCAAGCAATGGAGCTGTGAGCAACCTCAAACCAATAACAATAATAAACCACCAATGGGTTGGAACCCAATTAGCAGAGCAACTTTTGACACTAAGGCTCTTCACTT
CATTAGCAAAGCCATCAAGGGAGACTATGCTCTCAATCATTTCAAATTGGATAATGCAAAAAATAGTGAATCAGGTCCTACAGATACCAAAGACAATCATCATCTTCTTC
CTGAGGTTAAACTCCATGAAAGAGTTTCTAGAAAGAGTGGCCTTCTCGTCGCCTCGTCTCCATTGCGAGACCCCCGACATCCTTCTCCAAAGCAACGAGAGCGAAGTTCA
TTGGACATGCCACCACCAAAATCTATGCCAATGCCAATTCAAGCAGAAGAAAACATCCAAAATTGGCATCCTAACAAGCTATCAGAGAGTATCATGAAGTGCTTGAACTT
CGTATATGTGAGACTGCTGAGAGCCTCAAGAACAATGGAGCTAGAGAAGTCAGGTCCCATTTCAAGATCTTTGCATTACTCTTCCTTGAGCTCGAGAAGCTTCCGAGTCG
AGAACGGTTTAAACTCGAGCCTTTCAGTACACAAAGAACTGAGGCAACAAGATCCTTACGGCATCTTTGAAAACGAAGAATCGATACCGAGGGATATTGGCCCTTACAAG
AACTTGGTCATATTCACTTCAACTTCCATGGATCCCAAATCTATATCCAGTGCCACTTTCATCCCTCTCATAAGGAAGCTAAGGGTCTTGATGAGCAATCTGCAAAAAGT
GGATTTACGGCCATTGAGTTACCAACAAAAACTTGCATTTTGGATCAACATGTACAATGCTTGTATCATGAATGGATTTCTCCAATATGGAGTGCCTTCTTCTCCAGAAA
AACTAGCCACTTTGATGAATAAGGCAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCACATTGCTCCTCCTCCTCCTCCTCACTCTCCCTCCCAATTTCTGAAGAAGAAGATTAGTGGGCAAAGAAAGAAGGAGGAGCTTGAAAGAGAGGTGTTGATGCT
TCAAAAATTATTAAATCAGGAAGAAAAGGTGCATGAGATTTTAGAAGGTGTTAATAAACAGCAAAATGGTTCAGCACTTGGCATGTCAAATTTGCTTCCTCCCAAGGTAA
AGGAAGTGTTGGCAGAACTAGCAATGGTGGAAAGTGAAATAGCAAGGCTTGAGATTCAAATAACTCAACTCCAAAAGGACTTGAAAACTGAGCAACAACATACCACAACA
AAGTCCAAGCAATGGAGCTGTGAGCAACCTCAAACCAATAACAATAATAAACCACCAATGGGTTGGAACCCAATTAGCAGAGCAACTTTTGACACTAAGGCTCTTCACTT
CATTAGCAAAGCCATCAAGGGAGACTATGCTCTCAATCATTTCAAATTGGATAATGCAAAAAATAGTGAATCAGGTCCTACAGATACCAAAGACAATCATCATCTTCTTC
CTGAGGTTAAACTCCATGAAAGAGTTTCTAGAAAGAGTGGCCTTCTCGTCGCCTCGTCTCCATTGCGAGACCCCCGACATCCTTCTCCAAAGCAACGAGAGCGAAGTTCA
TTGGACATGCCACCACCAAAATCTATGCCAATGCCAATTCAAGCAGAAGAAAACATCCAAAATTGGCATCCTAACAAGCTATCAGAGAGTATCATGAAGTGCTTGAACTT
CGTATATGTGAGACTGCTGAGAGCCTCAAGAACAATGGAGCTAGAGAAGTCAGGTCCCATTTCAAGATCTTTGCATTACTCTTCCTTGAGCTCGAGAAGCTTCCGAGTCG
AGAACGGTTTAAACTCGAGCCTTTCAGTACACAAAGAACTGAGGCAACAAGATCCTTACGGCATCTTTGAAAACGAAGAATCGATACCGAGGGATATTGGCCCTTACAAG
AACTTGGTCATATTCACTTCAACTTCCATGGATCCCAAATCTATATCCAGTGCCACTTTCATCCCTCTCATAAGGAAGCTAAGGGTCTTGATGAGCAATCTGCAAAAAGT
GGATTTACGGCCATTGAGTTACCAACAAAAACTTGCATTTTGGATCAACATGTACAATGCTTGTATCATGAATGGATTTCTCCAATATGGAGTGCCTTCTTCTCCAGAAA
AACTAGCCACTTTGATGAATAAGGCAA
Protein sequenceShow/hide protein sequence
MAHIAPPPPPHSPSQFLKKKISGQRKKEELEREVLMLQKLLNQEEKVHEILEGVNKQQNGSALGMSNLLPPKVKEVLAELAMVESEIARLEIQITQLQKDLKTEQQHTTT
KSKQWSCEQPQTNNNNKPPMGWNPISRATFDTKALHFISKAIKGDYALNHFKLDNAKNSESGPTDTKDNHHLLPEVKLHERVSRKSGLLVASSPLRDPRHPSPKQRERSS
LDMPPPKSMPMPIQAEENIQNWHPNKLSESIMKCLNFVYVRLLRASRTMELEKSGPISRSLHYSSLSSRSFRVENGLNSSLSVHKELRQQDPYGIFENEESIPRDIGPYK
NLVIFTSTSMDPKSISSATFIPLIRKLRVLMSNLQKVDLRPLSYQQKLAFWINMYNACIMNGFLQYGVPSSPEKLATLMNKA