; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020728 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020728
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein BPS1, chloroplastic-like
Genome locationchr7:1673440..1674474
RNA-Seq ExpressionLag0020728
SyntenyLag0020728
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008511 - Protein BYPASS-related


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019020.1 Protein BPS1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]7.9e-15682.08Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY
        MS+PQEPHRPFF FGNPFRAISPKGSKLSSRLVFLLATFEDSLA +L KLTPKS+ND+ SF WM LAMK+LCE HNDVK LI +L LPVSDW EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY

Query:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK
        LD+SVKLLDICNDFSS+LSQLNQGHL+LRCALHNLAS+SSNQF+ ARSSLD WN+HISSRTSRVE+ SP +D  EE LDLPKVK+SPKGK LM VMYGVK
Subjt:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK

Query:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA
        V TLFICSVFA AFSGSSKRLL I+VPDT+RWA AFTELQKNVNM IK N+SSGRFT LRDL AVDE V+KLHSMI  N+D GMKVEE QNL+VDLR EA
Subjt:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGGL
        EKLTQGVD LTKQVDEFF+IVLSGRDALL++LR+SETVFD GM GL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGGL

XP_022147420.1 protein BPS1, chloroplastic-like isoform X1 [Momordica charantia]2.0e-15983.82Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY
        MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLA FEDSLA RLKKLTPKS+NDI+SFSWMELAMK+LCETHNDVK LIE+LGLPVSDW+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY

Query:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK
        L +SVKLLDICNDFSSELSQLNQG LVLRCALHNL S+SSNQF+ A SSLDGW++HISSRTSR ESC   LD LEESLD PKVK+S KGK LMHVMYGVK
Subjt:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK

Query:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA
        VETLFICSVFAS FSGSSK+L PI+VPDT+RWA AFT+LQKNVN+ IKK YSSGRFTVLR+L AVDESV  LHSMI  N+D G+K EE QNLVVD RREA
Subjt:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGGL
        EKL+QG+D LTKQVDEFF IVLSGRDALL++LRASETVFD GMGGL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGGL

XP_022924524.1 protein BPS1, chloroplastic-like isoform X2 [Cucurbita moschata]4.4e-15481.21Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY
        MS+PQEPHRPFF FGNPFRAISPKGSKLSSRLVFLLATFEDSLA +L+KLTPKS++D+ SF WM LAMK+LCE HNDVK LI +L LPVSDW EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY

Query:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK
        LD+SVKLLDICNDFSS+LSQLNQGHL+LRCALHNLAS+SSNQF+ ARSSLD WN+HISSRTSRVE+ SP +D  EE LDLPKVK+SPKGK LM VMYGVK
Subjt:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK

Query:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA
        V TLFICSVFA AFSGSSKRLL I+VPDT+RWA AF ELQKNVNM IK N+SSGRFT LRDL AVDE V+KLHSMI  N+D GMKVEE QNL+VDLR EA
Subjt:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGGL
        EKLTQGVD LTKQVDEFF+IVLSGRDALL++LR+SETVFD GM  L
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGGL

XP_023526707.1 protein BPS1, chloroplastic-like [Cucurbita pepo subsp. pepo]3.0e-15582.32Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY
        MS+PQEPHRPFF FGNPFRAISPKGSKLSSRLVFLLATFEDSLA +L KLTPKS+ND+ SF WM LAMK+LCE HNDVK LI +L LPVSDW EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY

Query:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK
        LDVSVKLLDICNDFSS+LSQLNQGHL+LRCALHNLAS+SSNQF+ ARSSLD WN+HISSRTSRVE+ SP +D LEE LDLPKVK+SPKGK LM VMYGVK
Subjt:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK

Query:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA
        V TLFICSVFA AFSGSSKRLL I+VPDT+RWA AFTELQKNVNM IK N+SSGRFT LRDL AVDE V+KLHSMI  N+D GMKVEE QNL+VDLR EA
Subjt:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGG
        EKLTQGVD LTKQVDEFF+IVLSGRDALL++LR+SE VFD GM G
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGG

XP_038877787.1 protein BPS1, chloroplastic-like [Benincasa hispida]9.1e-16085.51Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY
        MSRPQEPH PFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLA RLKKLTPKSENDILSFSWMELAM +LCETHNDVK LIEELG P SDWDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY

Query:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK
        L+VSVKLLDICNDFSSELSQLNQGHL++RCALHNL S+SS+QF+HA SSLD WN+HISSRTSRVES S  LD LEESLDLPKVK+S KGK LMHV+YGVK
Subjt:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK

Query:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA
        V TLFICS FASA SGSSKRLLP +V DTFRWAHAFTELQKNVNMEIKK YSSGR T LRD+ AVDESV+KLHSMI  NMD  MKVEE Q L+VDLRREA
Subjt:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGG
        EKLTQGVD LTKQVD FFHIVLSGRDALL++LRASETVFD GM G
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGG

TrEMBL top hitse value%identityAlignment
A0A1S3AVG0 protein BPS1, chloroplastic-like1.0e-15381.79Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRLVFLLATFEDSLA RLKKLTPKSENDILSFSWMELAMK+L ETHN+VK LIEELG PVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY

Query:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK
        L++SVKLLDICNDFSSELSQLNQGHL+LRCALHNL S+SSNQ + A SSLD WN+HISSRTSRV+S SP LD L+ESLDLPKVK+S KGK LMH +YGVK
Subjt:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK

Query:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA
        V TLFICSVFAS+FSGSS+ LLP +VPD+FRWA AFTELQK VNMEIKK YSSGRFT LRD+ AV+E V+KLHSMI  NMDD    EE QNL+V+LRREA
Subjt:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGGL
        E LTQGVD LTKQVDEFFHIVLSGRD LL++LRASETVF  GMGGL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGGL

A0A5D3D1K6 Protein BPS11.0e-15381.79Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRLVFLLATFEDSLA RLKKLTPKSENDILSFSWMELAMK+L ETHN+VK LIEELG PVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY

Query:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK
        L++SVKLLDICNDFSSELSQLNQGHL+LRCALHNL S+SSNQ + A SSLD WN+HISSRTSRV+S SP LD L+ESLDLPKVK+S KGK LMH +YGVK
Subjt:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK

Query:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA
        V TLFICSVFAS+FSGSS+ LLP +VPD+FRWA AFTELQK VNMEIKK YSSGRFT LRD+ AV+E V+KLHSMI  NMDD    EE QNL+V+LRREA
Subjt:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGGL
        E LTQGVD LTKQVDEFFHIVLSGRD LL++LRASETVF  GMGGL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGGL

A0A6J1D2B3 protein BPS1, chloroplastic-like isoform X19.8e-16083.82Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY
        MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLA FEDSLA RLKKLTPKS+NDI+SFSWMELAMK+LCETHNDVK LIE+LGLPVSDW+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY

Query:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK
        L +SVKLLDICNDFSSELSQLNQG LVLRCALHNL S+SSNQF+ A SSLDGW++HISSRTSR ESC   LD LEESLD PKVK+S KGK LMHVMYGVK
Subjt:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK

Query:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA
        VETLFICSVFAS FSGSSK+L PI+VPDT+RWA AFT+LQKNVN+ IKK YSSGRFTVLR+L AVDESV  LHSMI  N+D G+K EE QNLVVD RREA
Subjt:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGGL
        EKL+QG+D LTKQVDEFF IVLSGRDALL++LRASETVFD GMGGL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGGL

A0A6J1E9E6 protein BPS1, chloroplastic-like isoform X22.1e-15481.21Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY
        MS+PQEPHRPFF FGNPFRAISPKGSKLSSRLVFLLATFEDSLA +L+KLTPKS++D+ SF WM LAMK+LCE HNDVK LI +L LPVSDW EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY

Query:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK
        LD+SVKLLDICNDFSS+LSQLNQGHL+LRCALHNLAS+SSNQF+ ARSSLD WN+HISSRTSRVE+ SP +D  EE LDLPKVK+SPKGK LM VMYGVK
Subjt:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK

Query:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA
        V TLFICSVFA AFSGSSKRLL I+VPDT+RWA AF ELQKNVNM IK N+SSGRFT LRDL AVDE V+KLHSMI  N+D GMKVEE QNL+VDLR EA
Subjt:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGGL
        EKLTQGVD LTKQVDEFF+IVLSGRDALL++LR+SETVFD GM  L
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGGL

A0A6J1ISZ1 protein BPS1, chloroplastic-like2.8e-15481.5Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY
        MS+PQEPHRPFF FGNPFRAISPKGSKLSSRLVFLLATFEDSLA +L KLTPKS+ND+ SF WM LAMK+LCE HNDVK LI +L LPVSDW EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY

Query:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK
        LD+SVKLLDICNDFSS+LSQLNQGHL+LRCALHNLAS+SSNQF+ AR SLD WN+HISSRTSRVE+ SP +D LEE LDLPKVK+SPKGK LM VMYGVK
Subjt:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK

Query:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA
        V TLFICSVFA AFSGSSKRLL I VPDT+RWA AFTELQKNVNM IK N+SSGRFT LR L AVDE V+KLHSMI  N+D GMKVEE QNL+VDLR EA
Subjt:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMI--NMDDGMKVEEIQNLVVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGGL
        EKLTQGVD LTKQVDEFFH+VLSGRDALL++LR+SETVF  GM GL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWGMGGL

SwissProt top hitse value%identityAlignment
A2Z9A6 UPF0496 protein 46.3e-6340.12Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY
        MSR     R FFP GNPFR + P G+ LS +L  LLA++ED+LA  L+KL P++ +D+L+ SWM LA+  L E H ++  LI +L LPVSDWD+KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY

Query:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASS----SSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVM
        L+ SVKLLDIC   SSELS+L+QG L+L+ ALH L S     S  Q   A  SL  W   +  R  R+ SCS TL  L  +L L KVK+S KGK LM  +
Subjt:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASS----SSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVM

Query:  YGVKVETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMINMDDGMKVEEIQNLVVDLRR
        YG++  T+F+CS+F +  SGS K L+ + VP+ F W+ AF +L   V+ E+ +  + G    +++L  V+   ++LH + +       EE  NL   +  
Subjt:  YGVKVETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMINMDDGMKVEEIQNLVVDLRR

Query:  EAEKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWG
          E++    D + ++ D    + L+        +  SE++ + G
Subjt:  EAEKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWG

A2Z9A6 UPF0496 protein 44.1e-0640.28Show/hide
Query:  GAVDESVRKLHSMINMDDGMKVEEIQNLVVDLRREAEKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASE
        G+ DES   +    ++ +    EE+ N +  + + AE L  G+D L+K+V +FF IVL+GRDALL +LR S+
Subjt:  GAVDESVRKLHSMINMDDGMKVEEIQNLVVDLRREAEKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASE

Q337C0 UPF0496 protein 44.3e-6440.7Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY
        MSR     R FFP GNPFR + P G+ LS +L  LLA++ED+LA  L+KL P++ +D+L+ SWM LA+  L E H ++  LI +L LPVSDWD+KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY

Query:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASS----SSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVM
        L+ SVKLLDIC   SSELS+L+QG L+L+ ALH L S     S  Q   A  SL  W   +  R +R+ SCS TL  L  +L L KVK+S KGK LM  +
Subjt:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASS----SSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVM

Query:  YGVKVETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMINMDDGMKVEEIQNLVVDLRR
        YG++  T+F+CS+F +  SGS K L+ + VP+ F W+ AF +L   V+ E+ +  S G    +++L  V+   R+LH + +       EE  NL   +  
Subjt:  YGVKVETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMINMDDGMKVEEIQNLVVDLRR

Query:  EAEKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWG
          E++    D + ++ D    + L+        +  SE++ + G
Subjt:  EAEKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETVFDWG

Q337C0 UPF0496 protein 44.1e-0640.28Show/hide
Query:  GAVDESVRKLHSMINMDDGMKVEEIQNLVVDLRREAEKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASE
        G+ DES   +    ++ +    EE+ N +  + + AE L  G+D L+K+V +FF IVL+GRDALL +LR S+
Subjt:  GAVDESVRKLHSMINMDDGMKVEEIQNLVVDLRREAEKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASE

Q9LMM6 Protein BPS1, chloroplastic1.9e-9151.5Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA+ + KL PK ++DIL+ SWM+ AM+ LCETHN +K LI +L LPVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY

Query:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK
        LD+SVKLLD+CN FSSEL++LNQGHL+L+ ALHNL ++S      A+SSLD W +HI S+  R+E+C   L  L ++L+LPKVK+S KGK LM  +YGVK
Subjt:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK

Query:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMINMD--DGMKVEEIQNLVVDLRREA
        V+TL+I  VFA+AFSGSS+ L+ ++V +   WA +F E+Q  +N EIK  + S   TVL++L AV   V+KL+  I     D + ++ +++ V +     
Subjt:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMINMD--DGMKVEEIQNLVVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRA
          L+ G+D ++K+VD FF I+LSGRD LL +LR+
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRA

Arabidopsis top hitse value%identityAlignment
AT1G01550.1 Protein of unknown function (DUF793)1.3e-9251.5Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA+ + KL PK ++DIL+ SWM+ AM+ LCETHN +K LI +L LPVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY

Query:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK
        LD+SVKLLD+CN FSSEL++LNQGHL+L+ ALHNL ++S      A+SSLD W +HI S+  R+E+C   L  L ++L+LPKVK+S KGK LM  +YGVK
Subjt:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK

Query:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMINMD--DGMKVEEIQNLVVDLRREA
        V+TL+I  VFA+AFSGSS+ L+ ++V +   WA +F E+Q  +N EIK  + S   TVL++L AV   V+KL+  I     D + ++ +++ V +     
Subjt:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMINMD--DGMKVEEIQNLVVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRA
          L+ G+D ++K+VD FF I+LSGRD LL +LR+
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRA

AT1G01550.2 Protein of unknown function (DUF793)1.3e-9251.5Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA+ + KL PK ++DIL+ SWM+ AM+ LCETHN +K LI +L LPVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY

Query:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK
        LD+SVKLLD+CN FSSEL++LNQGHL+L+ ALHNL ++S      A+SSLD W +HI S+  R+E+C   L  L ++L+LPKVK+S KGK LM  +YGVK
Subjt:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK

Query:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMINMD--DGMKVEEIQNLVVDLRREA
        V+TL+I  VFA+AFSGSS+ L+ ++V +   WA +F E+Q  +N EIK  + S   TVL++L AV   V+KL+  I     D + ++ +++ V +     
Subjt:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMINMD--DGMKVEEIQNLVVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRA
          L+ G+D ++K+VD FF I+LSGRD LL +LR+
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRA

AT2G46080.1 CONTAINS InterPro DOMAIN/s: Protein BYPASS related (InterPro:IPR008511)4.1e-10254.6Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY
        MSRPQ+P R FFPFGNPFR +S KGS LS  L+ LL  FE  L  RLKKL PK+++DIL+ SWM+LAM+ LCETH ++  LI +L LPVSDW+EKW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEY

Query:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK
        L++SV+LLD+CN FSSEL++LNQG L L+C LHNL S S  +++ ARSSLD W +H+++   R+E+C   LD L +SL LPKVK+SPKGK LM   YGVK
Subjt:  LDVSVKLLDICNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVK

Query:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMINMD-DGMKVEEIQNLVVDLRREAE
        V+T++ICSVF +A+S S+K L  + V +   WA  FT++Q  VN EI+   SSGR T+L++L +VD SV KL+ MI    D ++VE  ++ V++L  +AE
Subjt:  VETLFICSVFASAFSGSSKRLLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMINMD-DGMKVEEIQNLVVDLRREAE

Query:  KLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETV
        KL+QG+D+L ++VD FF + L GRD LL +LR+S+++
Subjt:  KLTQGVDRLTKQVDEFFHIVLSGRDALLTSLRASETV

AT3G61500.1 unknown protein5.0e-2337.72Show/hide
Query:  FRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEK--WLDEYLDVSVKLLDICNDFS
        FR + PK +        LL  FE SL  RLKKL P++ ++IL+  WM LAM++L +THND+  LI +L L      E   W + Y++++ KLLD+CN F 
Subjt:  FRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEK--WLDEYLDVSVKLLDICNDFS

Query:  SELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPK
        S L  +N G + L+   H L   S +      S+LD W  +I   T+ +  C   L R  ESL+  K
Subjt:  SELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPK

AT4G01360.1 unknown protein1.9e-5438.6Show/hide
Query:  NPFRAISPK--GSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEYLDVSVKLLDICND
        NPF+ +  K   ++LS +L+ LL  FE +L   +++L PK +NDI+S SWM  AM+ LCETH  ++ L+++L +PVSD +E ++  + D S+K  ++CN 
Subjt:  NPFRAISPK--GSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEYLDVSVKLLDICND

Query:  FSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLD-------LPKVKDSPKGKFLMHVMYGVKVETLFI
        F+SE+  L  G+L+L+ A   L ++S N        L  WN+H+ S+   +E+    L RL ES+D         K K S +GK L+ V+YGVKV+TL+I
Subjt:  FSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLD-------LPKVKDSPKGKFLMHVMYGVKVETLFI

Query:  CSVFASAFSGSSKRLLPISVP---DTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMINMDDG--MKVEEIQNLVVDLRREAEKL
         SVF ++FSGSSK L  +++P   +   W  AF ELQ  +N EIK  + S  FTV++DL AV+  V+KL++ +       + VE ++  V++L    E +
Subjt:  CSVFASAFSGSSKRLLPISVP---DTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMINMDDG--MKVEEIQNLVVDLRREAEKL

Query:  TQGVDRLTKQVDEFFHIVLSGRDALLTSL
        ++    L+K       +V+S RDALL SL
Subjt:  TQGVDRLTKQVDEFFHIVLSGRDALLTSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCGACCACAAGAGCCGCACCGGCCTTTCTTTCCTTTTGGGAATCCTTTCCGTGCAATATCACCGAAGGGTTCAAAATTGTCTTCTAGACTTGTTTTTCTGTTAGC
CACTTTTGAGGATTCCTTGGCAGCGAGGCTGAAAAAGCTCACCCCAAAATCAGAGAATGACATACTTAGCTTCTCATGGATGGAATTAGCAATGAAGATGCTGTGTGAAA
CTCACAATGATGTAAAAGCCCTTATAGAAGAGCTTGGGCTCCCTGTGTCTGACTGGGATGAGAAATGGCTAGATGAATACCTGGACGTCAGTGTGAAATTACTTGATATA
TGCAATGATTTTAGCTCTGAGCTCTCACAGTTGAATCAGGGCCATCTGGTACTTCGGTGTGCCTTGCACAATCTGGCATCTTCATCTTCGAACCAGTTCATTCATGCCCG
TTCTTCGTTAGATGGATGGAATCGACATATTAGTTCCAGAACCTCCAGAGTAGAGAGCTGTTCTCCTACTTTGGACCGTCTTGAGGAATCACTTGATCTTCCAAAGGTTA
AGGACTCACCTAAAGGCAAGTTTTTGATGCATGTGATGTACGGAGTGAAGGTGGAGACTCTGTTTATTTGCAGTGTTTTTGCTTCTGCCTTCTCAGGTTCTTCCAAAAGG
TTGTTACCTATCAGTGTTCCAGATACATTTAGATGGGCACACGCTTTTACTGAATTACAGAAGAATGTAAATATGGAAATTAAAAAAAATTATTCTAGTGGAAGATTTAC
TGTATTAAGAGATCTTGGTGCAGTTGATGAGAGTGTAAGAAAATTGCATTCCATGATAAACATGGACGATGGCATGAAAGTAGAAGAAATCCAGAATTTGGTTGTAGATT
TGAGAAGAGAGGCAGAAAAGCTTACACAAGGTGTTGATCGTCTTACAAAACAAGTTGATGAGTTTTTTCACATTGTTTTATCTGGACGCGATGCATTGCTTACAAGTCTT
AGAGCAAGTGAGACGGTATTTGATTGGGGAATGGGGGGTCTATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCGACCACAAGAGCCGCACCGGCCTTTCTTTCCTTTTGGGAATCCTTTCCGTGCAATATCACCGAAGGGTTCAAAATTGTCTTCTAGACTTGTTTTTCTGTTAGC
CACTTTTGAGGATTCCTTGGCAGCGAGGCTGAAAAAGCTCACCCCAAAATCAGAGAATGACATACTTAGCTTCTCATGGATGGAATTAGCAATGAAGATGCTGTGTGAAA
CTCACAATGATGTAAAAGCCCTTATAGAAGAGCTTGGGCTCCCTGTGTCTGACTGGGATGAGAAATGGCTAGATGAATACCTGGACGTCAGTGTGAAATTACTTGATATA
TGCAATGATTTTAGCTCTGAGCTCTCACAGTTGAATCAGGGCCATCTGGTACTTCGGTGTGCCTTGCACAATCTGGCATCTTCATCTTCGAACCAGTTCATTCATGCCCG
TTCTTCGTTAGATGGATGGAATCGACATATTAGTTCCAGAACCTCCAGAGTAGAGAGCTGTTCTCCTACTTTGGACCGTCTTGAGGAATCACTTGATCTTCCAAAGGTTA
AGGACTCACCTAAAGGCAAGTTTTTGATGCATGTGATGTACGGAGTGAAGGTGGAGACTCTGTTTATTTGCAGTGTTTTTGCTTCTGCCTTCTCAGGTTCTTCCAAAAGG
TTGTTACCTATCAGTGTTCCAGATACATTTAGATGGGCACACGCTTTTACTGAATTACAGAAGAATGTAAATATGGAAATTAAAAAAAATTATTCTAGTGGAAGATTTAC
TGTATTAAGAGATCTTGGTGCAGTTGATGAGAGTGTAAGAAAATTGCATTCCATGATAAACATGGACGATGGCATGAAAGTAGAAGAAATCCAGAATTTGGTTGTAGATT
TGAGAAGAGAGGCAGAAAAGCTTACACAAGGTGTTGATCGTCTTACAAAACAAGTTGATGAGTTTTTTCACATTGTTTTATCTGGACGCGATGCATTGCTTACAAGTCTT
AGAGCAAGTGAGACGGTATTTGATTGGGGAATGGGGGGTCTATAA
Protein sequenceShow/hide protein sequence
MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAARLKKLTPKSENDILSFSWMELAMKMLCETHNDVKALIEELGLPVSDWDEKWLDEYLDVSVKLLDI
CNDFSSELSQLNQGHLVLRCALHNLASSSSNQFIHARSSLDGWNRHISSRTSRVESCSPTLDRLEESLDLPKVKDSPKGKFLMHVMYGVKVETLFICSVFASAFSGSSKR
LLPISVPDTFRWAHAFTELQKNVNMEIKKNYSSGRFTVLRDLGAVDESVRKLHSMINMDDGMKVEEIQNLVVDLRREAEKLTQGVDRLTKQVDEFFHIVLSGRDALLTSL
RASETVFDWGMGGL