; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G001350 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G001350
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionprotein BPS1, chloroplastic-like
Genome locationCG_Chr05:1404640..1407084
RNA-Seq ExpressionClCG05G001350
SyntenyClCG05G001350
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004320 - Protein of unknown function DUF241, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019020.1 Protein BPS1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]3.8e-15982.72Show/hide
Query:  CYTMSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWL
        CY MS+PQEPHRPFF FGNPFRAISPKGSKLSSRLVFLLATFEDSLA++L KLTPKS+ND+ SF WM LA+KLL E HNDVKTLI +L  PVSDW EKWL
Subjt:  CYTMSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWL

Query:  DEYLNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLY
        DEYL+ISVKLLDICNDFSS+LSQLNQGHLILRCALHNL STSSN FV ARSSLDAWNQHISSRTSRVE+ +PI+D  EE LDLPKVKNS KGKVLM V+Y
Subjt:  DEYLNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLY

Query:  GVKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLR
        GVKV TLFICSVFA AFSGSSKRLL  NVPDT+RWA AFTELQKNVNM IK  +SSGRFTALRD++AVD  VKKLHSMIQ N+DG MKVEE QNLIVDLR
Subjt:  GVKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLR

Query:  REAEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQ
         EAEKLTQGVD LTKQVDEFF+IVLSGRDALLSNLR+SET FD+G+  LSTRQ
Subjt:  REAEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQ

XP_004133852.1 protein BPS1, chloroplastic [Cucumis sativus]5.9e-16887.46Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRL FLLATFEDSLAERLKKLTPKSENDILSFSWMELA+KLL ETHNDVKTL+EELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSN  V A SSLDAWNQHISSRTSRV+S  PILD L ESLDLPKVKNSSKGKVLMHVLY VK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA AFTELQK VNMEIK IYSSGRFTALRDVDAV+  VKKLHSMIQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL
        E LTQGVD LTKQVDEFFHIVLSGRD LLSNLRA+E  F +G+G L TRQL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL

XP_008438008.1 PREDICTED: protein BPS1, chloroplastic-like [Cucumis melo]1.8e-16988.32Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELA+KLL ETHN+VKTLIEELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSN  V A SSLDAWNQHISSRTSRV+S +PILD L+ESLDLPKVKNSSKGKVLMH LYGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA AFTELQK VNMEIK IYSSGRFTALRDVDAV+  VKKLHSMIQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL
        E LTQGVD LTKQVDEFFHIVLSGRD LLSNLRASET F +G+G L TRQL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL

XP_022147420.1 protein BPS1, chloroplastic-like isoform X1 [Momordica charantia]1.1e-16183.52Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLA FEDSLAERLKKLTPKS+NDI+SFSWMELA+KLL ETHNDVKTLIE+LG PVSDW+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        L ISVKLLDICNDFSSELSQLNQG L+LRCALHNLESTSSN FV A SSLD W++HISSRTSR ES   ILD LEESLD PKVKNSSKGKVLMHV+YGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        V TLFICSVFAS FSGSSK+L P  VPDT+RWA AFT+LQKNVN+ IK IYSSGRFT LR+++AVD SV  LHSMIQGN+DG +K EEFQNL+VD RREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLS-TRQL
        EKL+QG+D LTKQVDEFF IVLSGRDALLSNLRASET FD+G+G LS TRQL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLS-TRQL

XP_038877787.1 protein BPS1, chloroplastic-like [Benincasa hispida]1.1e-17792.02Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSRPQEPH PFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELA+ LL ETHNDVKTLIEELGFP SDWDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        LN+SVKLLDICNDFSSELSQLNQGHLI+RCALHNLESTSS+ FVHA SSLDAWNQHISSRTSRVES + ILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICS FASA SGSSKRLLPTNV DTFRWAHAFTELQKNVNMEIK IYSSGR TALRDVDAVD SVKKLHSMIQGNMDGCMKVEEFQ LIVDLRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL
        EKLTQGVD LTKQVD FFHIVLSGRDALLSNLRASET FD+G+   STRQL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL

TrEMBL top hitse value%identityAlignment
A0A0A0L3G5 Uncharacterized protein2.9e-16887.46Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRL FLLATFEDSLAERLKKLTPKSENDILSFSWMELA+KLL ETHNDVKTL+EELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSN  V A SSLDAWNQHISSRTSRV+S  PILD L ESLDLPKVKNSSKGKVLMHVLY VK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA AFTELQK VNMEIK IYSSGRFTALRDVDAV+  VKKLHSMIQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL
        E LTQGVD LTKQVDEFFHIVLSGRD LLSNLRA+E  F +G+G L TRQL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL

A0A1S3AVG0 protein BPS1, chloroplastic-like8.9e-17088.32Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELA+KLL ETHN+VKTLIEELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSN  V A SSLDAWNQHISSRTSRV+S +PILD L+ESLDLPKVKNSSKGKVLMH LYGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA AFTELQK VNMEIK IYSSGRFTALRDVDAV+  VKKLHSMIQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL
        E LTQGVD LTKQVDEFFHIVLSGRD LLSNLRASET F +G+G L TRQL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL

A0A5D3D1K6 Protein BPS18.9e-17088.32Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELA+KLL ETHN+VKTLIEELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSN  V A SSLDAWNQHISSRTSRV+S +PILD L+ESLDLPKVKNSSKGKVLMH LYGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA AFTELQK VNMEIK IYSSGRFTALRDVDAV+  VKKLHSMIQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL
        E LTQGVD LTKQVDEFFHIVLSGRD LLSNLRASET F +G+G L TRQL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL

A0A6J1D2B3 protein BPS1, chloroplastic-like isoform X15.2e-16283.52Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLA FEDSLAERLKKLTPKS+NDI+SFSWMELA+KLL ETHNDVKTLIE+LG PVSDW+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        L ISVKLLDICNDFSSELSQLNQG L+LRCALHNLESTSSN FV A SSLD W++HISSRTSR ES   ILD LEESLD PKVKNSSKGKVLMHV+YGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        V TLFICSVFAS FSGSSK+L P  VPDT+RWA AFT+LQKNVN+ IK IYSSGRFT LR+++AVD SV  LHSMIQGN+DG +K EEFQNL+VD RREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLS-TRQL
        EKL+QG+D LTKQVDEFF IVLSGRDALLSNLRASET FD+G+G LS TRQL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLS-TRQL

A0A6J1E9E6 protein BPS1, chloroplastic-like isoform X25.1e-15782.29Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFF FGNPFRAISPKGSKLSSRLVFLLATFEDSLA++L+KLTPKS++D+ SF WM LA+KLL E HNDVKTLI +L  PVSDW EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        L+ISVKLLDICNDFSS+LSQLNQGHLILRCALHNL STSSN FV ARSSLDAWNQHISSRTSRVE+ +PI+D  EE LDLPKVKNS KGKVLM V+YGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        V TLFICSVFA AFSGSSKRLL  NVPDT+RWA AF ELQKNVNM IK  +SSGRFTALRD++AVD  VKKLHSMIQ N+DG MKVEE QNLIVDLR EA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQ
        EKLTQGVD LTKQVDEFF+IVLSGRDALLSNLR+SET FD+G+  LSTRQ
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQ

SwissProt top hitse value%identityAlignment
A2Z9A6 UPF0496 protein 43.7e-6435.87Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSR     R FFP GNPFR + P G+ LS +L  LLA++ED+LA  L+KL P++ +D+L+ SWM LA+  L E H ++  LI +L  PVSDWD+KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLES----TSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVL
        LN SVKLLDIC   SSELS+L+QG L+L+ ALH L S     S      A  SL  W + +  R  R+ S +  L  L  +L L KVKNS KGKVLM  L
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLES----TSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVL

Query:  YGVKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLH-----------------------
        YG++ VT+F+CS+F +  SGS K L+  +VP+ F W+ AF +L   V+ E+    + G   A+++++ V+   K+LH                       
Subjt:  YGVKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLH-----------------------

Query:  ------SMIQ-----------------------------GNMDGCMK------------------------------------------------VEEFQ
              S++Q                             G  +  MK                                                 EE  
Subjt:  ------SMIQ-----------------------------GNMDGCMK------------------------------------------------VEEFQ

Query:  NLIVDLRREAEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASE
        N I  + + AE L  G+D L+K+V +FF IVL+GRDALL NLR S+
Subjt:  NLIVDLRREAEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASE

Q337C0 UPF0496 protein 41.3e-6435.43Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSR     R FFP GNPFR + P G+ LS +L  LLA++ED+LA  L+KL P++ +D+L+ SWM LA+  L E H ++  LI +L  PVSDWD+KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLES----TSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVL
        LN SVKLLDIC   SSELS+L+QG L+L+ ALH L S     S      A  SL  W + +  R +R+ S +  L  L  +L L KVKNS+KGKVLM  L
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLES----TSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVL

Query:  YGVKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQ-------------------
        YG++ VT+F+CS+F +  SGS K L+  +VP+ F W+ AF +L   V+ E+    S G   A+++++ V+   ++LH +                     
Subjt:  YGVKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQ-------------------

Query:  ---------------------------------------GNMDGCMK------------------------------------------------VEEFQ
                                               G  +  MK                                                 EE  
Subjt:  ---------------------------------------GNMDGCMK------------------------------------------------VEEFQ

Query:  NLIVDLRREAEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASE
        N I  + + AE L  G+D L+K+V +FF IVL+GRDALL NLR S+
Subjt:  NLIVDLRREAEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASE

Q9LMM6 Protein BPS1, chloroplastic5.9e-9453.43Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA  + KL PK ++DIL+ SWM+ A++ L ETHN +KTLI +L  PVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        L+ISVKLLD+CN FSSEL++LNQGHL+L+ ALHNLE+ S      A+SSLD+W QHI S+  R+E+   IL  L ++L+LPKVKNS+KGKVLM  LYGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMI-QGNMDGCMKVEEFQNLIVDLRRE
        V TL+I  VFA+AFSGSS+ L+   V +   WA +F E+Q  +N EIKNI+ S   T L++++AV   VKKL+  I QG++D           +  L+  
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMI-QGNMDGCMKVEEFQNLIVDLRRE

Query:  AEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRA
          +L+ G+D ++K+VD FF I+LSGRD LL NLR+
Subjt:  AEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRA

Arabidopsis top hitse value%identityAlignment
AT1G01550.1 Protein of unknown function (DUF793)4.2e-9553.43Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA  + KL PK ++DIL+ SWM+ A++ L ETHN +KTLI +L  PVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        L+ISVKLLD+CN FSSEL++LNQGHL+L+ ALHNLE+ S      A+SSLD+W QHI S+  R+E+   IL  L ++L+LPKVKNS+KGKVLM  LYGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMI-QGNMDGCMKVEEFQNLIVDLRRE
        V TL+I  VFA+AFSGSS+ L+   V +   WA +F E+Q  +N EIKNI+ S   T L++++AV   VKKL+  I QG++D           +  L+  
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMI-QGNMDGCMKVEEFQNLIVDLRRE

Query:  AEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRA
          +L+ G+D ++K+VD FF I+LSGRD LL NLR+
Subjt:  AEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRA

AT1G01550.2 Protein of unknown function (DUF793)4.2e-9553.43Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA  + KL PK ++DIL+ SWM+ A++ L ETHN +KTLI +L  PVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        L+ISVKLLD+CN FSSEL++LNQGHL+L+ ALHNLE+ S      A+SSLD+W QHI S+  R+E+   IL  L ++L+LPKVKNS+KGKVLM  LYGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMI-QGNMDGCMKVEEFQNLIVDLRRE
        V TL+I  VFA+AFSGSS+ L+   V +   WA +F E+Q  +N EIKNI+ S   T L++++AV   VKKL+  I QG++D           +  L+  
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMI-QGNMDGCMKVEEFQNLIVDLRRE

Query:  AEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRA
          +L+ G+D ++K+VD FF I+LSGRD LL NLR+
Subjt:  AEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRA

AT2G46080.1 CONTAINS InterPro DOMAIN/s: Protein BYPASS related (InterPro:IPR008511)2.2e-10454.65Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSRPQ+P R FFPFGNPFR +S KGS LS  L+ LL  FE  L ERLKKL PK+++DIL+ SWM+LA++ L ETH ++ TLI +L  PVSDW+EKW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        LNISV+LLD+CN FSSEL++LNQG L L+C LHNL+S S   ++ ARSSLD+W QH+++   R+E+   +LD L +SL LPKVKNS KGKVLM   YGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        V T++ICSVF +A+S S+K L    V +   WA  FT++Q  VN EI+++ SSGR T L+++++VD SV+KL+ MIQ  +D  ++VE F++ +++L  +A
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIG
        EKL+QG+D+L ++VD FF + L GRD LL NLR+S++     +G
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIG

AT3G61500.1 unknown protein3.5e-2539.52Show/hide
Query:  FRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEK--WLDEYLNISVKLLDICNDFS
        FR + PK +        LL  FE SL ERLKKL P++ ++IL+  WM LA++LLY+THND+  LI +L        E   W + Y+NI+ KLLD+CN F 
Subjt:  FRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEK--WLDEYLNISVKLLDICNDFS

Query:  SELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPK
        S L  +N G + L+   H LE  S +L     S+LD+W ++I++  S+      +L    ESL+  K
Subjt:  SELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLDLPK

AT4G01360.1 unknown protein1.0e-5638.6Show/hide
Query:  NPFRAISPK--GSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEYLNISVKLLDICND
        NPF+ +  K   ++LS +L+ LL  FE +L   +++L PK +NDI+S SWM  A++ L ETH  ++TL+++L  PVSD +E ++  + + S+K  ++CN 
Subjt:  NPFRAISPK--GSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEYLNISVKLLDICND

Query:  FSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLD-------LPKVKNSSKGKVLMHVLYGVKVVTLFI
        F+SE+  L  G+L+L+ A   LE+ S N+       L  WNQH+ S+   +E++  +L  L ES+D         K K S++GKVL+ VLYGVKV TL+I
Subjt:  FSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEESLD-------LPKVKNSSKGKVLMHVLYGVKVVTLFI

Query:  CSVFASAFSGSSKRLLPTNVP---DTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREAEKL
         SVF ++FSGSSK L    +P   +   W  AF ELQ  +N EIKN + S  FT ++D++AV+  VKKL++ +Q      + VE  +  +++L    E +
Subjt:  CSVFASAFSGSSKRLLPTNVP---DTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREAEKL

Query:  TQGVDRLTKQVDEFFHIVLSGRDALLSNL
        ++    L+K       +V+S RDALL +L
Subjt:  TQGVDRLTKQVDEFFHIVLSGRDALLSNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTCCTCTCATTGCCACCACGCGAAAATTCAACGTTTTCTGGCTGCGATACTTTCTTAGAAATTTCCTTTGCTTTCTTGTTCCTGAATCTGTGTTATACAATGAGC
AGACCACAAGAGCCGCACCGGCCCTTCTTTCCTTTTGGGAATCCTTTCCGTGCAATATCACCAAAGGGTTCAAAATTGTCTTCTAGACTTGTTTTTCTTTTAGCT
ACTTTTGAGGATTCCTTGGCAGAGAGGCTGAAAAAGCTTACCCCAAAATCGGAGAATGACATACTCAGCTTCTCATGGATGGAATTAGCATTGAAGCTGCTGTAC
GAAACTCACAACGATGTAAAAACCCTTATAGAAGAGCTTGGGTTCCCCGTATCTGATTGGGATGAGAAATGGCTAGATGAGTACCTGAACATCAGTGTGAAATTA
CTTGATATATGCAATGATTTTAGCTCTGAGCTCTCACAGTTGAATCAAGGTCATCTGATACTTCGGTGTGCCTTGCACAATCTGGAATCTACATCTTCCAACCTG
TTTGTTCATGCCCGTTCTTCGCTAGATGCATGGAATCAACATATTAGTTCCAGAACCTCCAGAGTTGAAAGCTTTGCTCCTATTTTGGACTGTCTTGAGGAATCA
CTTGATCTTCCAAAGGTTAAGAACTCATCCAAAGGCAAGGTTTTGATGCATGTGTTGTACGGAGTGAAGGTGGTGACTTTGTTTATTTGCAGTGTTTTTGCTTCT
GCCTTCTCAGGTTCTTCCAAAAGGTTGTTACCCACCAATGTTCCAGATACATTCAGATGGGCACACGCATTTACTGAATTACAGAAAAATGTAAATATGGAAATT
AAGAATATTTATTCTAGTGGAAGGTTTACTGCGTTGAGAGATGTTGATGCAGTTGATGGGAGTGTAAAAAAACTACATTCCATGATTCAAGGTAATATGGATGGC
TGCATGAAAGTGGAAGAATTCCAGAATTTGATTGTAGATTTGAGGAGGGAGGCAGAGAAGCTTACACAAGGTGTTGATCGTCTTACAAAACAAGTTGATGAGTTT
TTTCACATTGTTTTATCTGGACGTGATGCATTGCTTTCAAATCTTAGAGCAAGTGAAACAGAATTTGATCGGGGAATTGGGGTGCTGTCTACAAGGCAACTGTGA
mRNA sequenceShow/hide mRNA sequence
CTCCTCTCATTGCCACCACGCGAAAATTCAACGTTTTCTGGCTGCGATACTTTCTTAGAAATTTCCTTTGCTTTCTTGTTCCTGAATCTGTGTTATACAATGAGC
AGACCACAAGAGCCGCACCGGCCCTTCTTTCCTTTTGGGAATCCTTTCCGTGCAATATCACCAAAGGGTTCAAAATTGTCTTCTAGACTTGTTTTTCTTTTAGCT
ACTTTTGAGGATTCCTTGGCAGAGAGGCTGAAAAAGCTTACCCCAAAATCGGAGAATGACATACTCAGCTTCTCATGGATGGAATTAGCATTGAAGCTGCTGTAC
GAAACTCACAACGATGTAAAAACCCTTATAGAAGAGCTTGGGTTCCCCGTATCTGATTGGGATGAGAAATGGCTAGATGAGTACCTGAACATCAGTGTGAAATTA
CTTGATATATGCAATGATTTTAGCTCTGAGCTCTCACAGTTGAATCAAGGTCATCTGATACTTCGGTGTGCCTTGCACAATCTGGAATCTACATCTTCCAACCTG
TTTGTTCATGCCCGTTCTTCGCTAGATGCATGGAATCAACATATTAGTTCCAGAACCTCCAGAGTTGAAAGCTTTGCTCCTATTTTGGACTGTCTTGAGGAATCA
CTTGATCTTCCAAAGGTTAAGAACTCATCCAAAGGCAAGGTTTTGATGCATGTGTTGTACGGAGTGAAGGTGGTGACTTTGTTTATTTGCAGTGTTTTTGCTTCT
GCCTTCTCAGGTTCTTCCAAAAGGTTGTTACCCACCAATGTTCCAGATACATTCAGATGGGCACACGCATTTACTGAATTACAGAAAAATGTAAATATGGAAATT
AAGAATATTTATTCTAGTGGAAGGTTTACTGCGTTGAGAGATGTTGATGCAGTTGATGGGAGTGTAAAAAAACTACATTCCATGATTCAAGGTAATATGGATGGC
TGCATGAAAGTGGAAGAATTCCAGAATTTGATTGTAGATTTGAGGAGGGAGGCAGAGAAGCTTACACAAGGTGTTGATCGTCTTACAAAACAAGTTGATGAGTTT
TTTCACATTGTTTTATCTGGACGTGATGCATTGCTTTCAAATCTTAGAGCAAGTGAAACAGAATTTGATCGGGGAATTGGGGTGCTGTCTACAAGGCAACTGTGA
Protein sequenceShow/hide protein sequence
LLSLPPRENSTFSGCDTFLEISFAFLFLNLCYTMSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLY
ETHNDVKTLIEELGFPVSDWDEKWLDEYLNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESFAPILDCLEES
LDLPKVKNSSKGKVLMHVLYGVKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDG
CMKVEEFQNLIVDLRREAEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL