; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013115 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013115
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein BPS1, chloroplastic-like
Genome locationChr01:26985514..26986707
RNA-Seq ExpressionHG10013115
SyntenyHG10013115
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008511 - Protein BYPASS-related


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019020.1 Protein BPS1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]2.3e-16283.15Show/hide
Query:  CSWCYIMSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDE
        CS CYIMS+PQEPHRPFF FGNPFRAISPKGSKLSSRLVFLLATFEDSLA++L KLTPKS+N + SF WM LAMKLLCE HNDVKTLI +L  PVSDW E
Subjt:  CSWCYIMSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDE

Query:  KWLDEYLNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMH
        KWLDEYL+ISVKLLDICNDFSS+LSQLNQGHL+LRCALHNL STSS+QFV ARSSLDAWNQHISSRTSRVE+ SPI+D  EE LDLPKVKNS KGKVLM 
Subjt:  KWLDEYLNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMH

Query:  VFYGAKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIV
        V YG KV TLFICSVFA AFSGSSKRLL  NVPDT+RWA  FTELQKNVNM IK  +SSGRFTALRD++AVDE VKKLHS+IQ N+DG MKVEE QNLIV
Subjt:  VFYGAKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIV

Query:  DLRREAEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLSPRK
        DLR EAEKLTQGVDHLTKQVDEFF+IVLSGRD LLSNLR+SETVFDQGM GLS R+
Subjt:  DLRREAEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLSPRK

XP_004133852.1 protein BPS1, chloroplastic [Cucumis sativus]2.7e-17188.03Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRL FLLATFEDSLAERLKKLTPKSEN ILSFSWMELAMKLL ETHNDVKTL+EELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        LNISVKLLDICNDFSSELSQLNQGHL+LRCALHNLESTSS+Q V+A SSLDAWNQHISSRTSRV+SC PILD L ESLDLPKVKNSSKGKVLMHV Y  K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA  FTELQK VNMEIKKIYSSGRFTALRDVDAV+E VKKLHS+IQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLSPRKL
        E LTQGVDHLTKQVDEFFHIVLSGRDELLSNLRA+E VF QGMGGL  R+L
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLSPRKL

XP_008438008.1 PREDICTED: protein BPS1, chloroplastic-like [Cucumis melo]5.3e-17288.89Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRLVFLLATFEDSLAERLKKLTPKSEN ILSFSWMELAMKLL ETHN+VKTLIEELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        LNISVKLLDICNDFSSELSQLNQGHL+LRCALHNLESTSS+Q V+A SSLDAWNQHISSRTSRV+S SPILD L+ESLDLPKVKNSSKGKVLMH  YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA  FTELQK VNMEIKKIYSSGRFTALRDVDAV+E VKKLHS+IQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLSPRKL
        E LTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVF QGMGGL  R+L
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLSPRKL

XP_022147420.1 protein BPS1, chloroplastic-like isoform X1 [Momordica charantia]3.4e-16685.59Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLA FEDSLAERLKKLTPKS+N I+SFSWMELAMKLLCETHNDVKTLIE+LG PVSDW+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        L ISVKLLDICNDFSSELSQLNQG LVLRCALHNLESTSS+QFVQA SSLD W++HISSRTSR ESC  ILD LEESLD PKVKNSSKGKVLMHV YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        V TLFICSVFAS FSGSSK+L P  VPDT+RWA  FT+LQKNVN+ IKKIYSSGRFT LR+++AVDESV  LHS+IQGN+DG +K EEFQNL+VD RREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLS
        EKL+QG+DHLTKQVDEFF IVLSGRD LLSNLRASETVFDQGMGGLS
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLS

XP_038877787.1 protein BPS1, chloroplastic-like [Benincasa hispida]2.2e-17892.02Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSRPQEPH PFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSEN ILSFSWMELAM LLCETHNDVKTLIEELGFP SDWDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        LN+SVKLLDICNDFSSELSQLNQGHL++RCALHNLESTSS QFV A SSLDAWNQHISSRTSRVES S ILD LEESLDLPKVKNSSKGKVLMHV YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICS FASA SGSSKRLLPTNV DTFRWAH FTELQKNVNMEIKKIYSSGR TALRDVDAVDESVKKLHS+IQGNMDGCMKVEEFQ LIVDLRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLSPRKL
        EKLTQGVDHLTKQVD FFHIVLSGRD LLSNLRASETVFDQGM G S R+L
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLSPRKL

TrEMBL top hitse value%identityAlignment
A0A0A0L3G5 Uncharacterized protein1.3e-17188.03Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRL FLLATFEDSLAERLKKLTPKSEN ILSFSWMELAMKLL ETHNDVKTL+EELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        LNISVKLLDICNDFSSELSQLNQGHL+LRCALHNLESTSS+Q V+A SSLDAWNQHISSRTSRV+SC PILD L ESLDLPKVKNSSKGKVLMHV Y  K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA  FTELQK VNMEIKKIYSSGRFTALRDVDAV+E VKKLHS+IQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLSPRKL
        E LTQGVDHLTKQVDEFFHIVLSGRDELLSNLRA+E VF QGMGGL  R+L
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLSPRKL

A0A1S3AVG0 protein BPS1, chloroplastic-like2.6e-17288.89Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRLVFLLATFEDSLAERLKKLTPKSEN ILSFSWMELAMKLL ETHN+VKTLIEELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        LNISVKLLDICNDFSSELSQLNQGHL+LRCALHNLESTSS+Q V+A SSLDAWNQHISSRTSRV+S SPILD L+ESLDLPKVKNSSKGKVLMH  YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA  FTELQK VNMEIKKIYSSGRFTALRDVDAV+E VKKLHS+IQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLSPRKL
        E LTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVF QGMGGL  R+L
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLSPRKL

A0A5D3D1K6 Protein BPS12.6e-17288.89Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRLVFLLATFEDSLAERLKKLTPKSEN ILSFSWMELAMKLL ETHN+VKTLIEELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        LNISVKLLDICNDFSSELSQLNQGHL+LRCALHNLESTSS+Q V+A SSLDAWNQHISSRTSRV+S SPILD L+ESLDLPKVKNSSKGKVLMH  YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA  FTELQK VNMEIKKIYSSGRFTALRDVDAV+E VKKLHS+IQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLSPRKL
        E LTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVF QGMGGL  R+L
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLSPRKL

A0A6J1D2B3 protein BPS1, chloroplastic-like isoform X11.6e-16685.59Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLA FEDSLAERLKKLTPKS+N I+SFSWMELAMKLLCETHNDVKTLIE+LG PVSDW+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        L ISVKLLDICNDFSSELSQLNQG LVLRCALHNLESTSS+QFVQA SSLD W++HISSRTSR ESC  ILD LEESLD PKVKNSSKGKVLMHV YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        V TLFICSVFAS FSGSSK+L P  VPDT+RWA  FT+LQKNVN+ IKKIYSSGRFT LR+++AVDESV  LHS+IQGN+DG +K EEFQNL+VD RREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLS
        EKL+QG+DHLTKQVDEFF IVLSGRD LLSNLRASETVFDQGMGGLS
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLS

A0A6J1ISZ1 protein BPS1, chloroplastic-like9.5e-15982.57Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFF FGNPFRAISPKGSKLSSRLVFLLATFEDSLA++L KLTPKS+N + SF WM LAMKLLCE HNDVKTLI +L  PVSDW EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        L+ISVKLLDICNDFSS+LSQLNQGHL+LRCALHNL STSS+QFV AR SLDAWNQHISSRTSRVE+ SPI+D LEE LDLPKVKNS KGKVLM V YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        V TLFICSVFA AFSGSSKRLL  +VPDT+RWA  FTELQKNVNM IK  +SSGRFTALR +DAVDE VKKLHS+IQ N+DG MKVEE QNLIVDLR EA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLSPRK
        EKLTQGVDHLTKQVDEFFH+VLSGRD LLSNLR+SETVF QGM GLS R+
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLSPRK

SwissProt top hitse value%identityAlignment
A2Z9A6 UPF0496 protein 48.6e-6442.45Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSR     R FFP GNPFR + P G+ LS +L  LLA++ED+LA  L+KL P++ + +L+ SWM LA+  L E H ++  LI +L  PVSDWD+KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLES----TSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVF
        LN SVKLLDIC   SSELS+L+QG L+L+ ALH L S     S  Q  +A  SL  W + +  R  R+ SCS  L  L  +L L KVKNS KGKVLM   
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLES----TSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVF

Query:  YGAKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDL
        YG + VT+F+CS+F +  SGS K L+  +VP+ F W+  F +L   V+ E+ +  + G   A+++++ V+   K+LH +   +                L
Subjt:  YGAKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDL

Query:  RREAEKLTQGVDHLTKQV
          EA  L   V H  ++V
Subjt:  RREAEKLTQGVDHLTKQV

Q337C0 UPF0496 protein 42.3e-6442.45Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSR     R FFP GNPFR + P G+ LS +L  LLA++ED+LA  L+KL P++ + +L+ SWM LA+  L E H ++  LI +L  PVSDWD+KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLES----TSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVF
        LN SVKLLDIC   SSELS+L+QG L+L+ ALH L S     S  Q  +A  SL  W + +  R +R+ SCS  L  L  +L L KVKNS+KGKVLM   
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLES----TSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVF

Query:  YGAKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDL
        YG + VT+F+CS+F +  SGS K L+  +VP+ F W+  F +L   V+ E+ +  S G   A+++++ V+   ++LH +   +                L
Subjt:  YGAKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDL

Query:  RREAEKLTQGVDHLTKQV
          EA  L   V H  ++V
Subjt:  RREAEKLTQGVDHLTKQV

Q9LMM6 Protein BPS1, chloroplastic4.6e-9453.13Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA  + KL PK ++ IL+ SWM+ AM+ LCETHN +KTLI +L  PVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        L+ISVKLLD+CN FSSEL++LNQGHL+L+ ALHNLE+ S     +A+SSLD+W QHI S+  R+E+C  IL  L ++L+LPKVKNS+KGKVLM   YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLH-SIIQGNMDGCMKVEEFQNLIVDLRRE
        V TL+I  VFA+AFSGSS+ L+   V +   WA  F E+Q  +N EIK I+ S   T L++++AV   VKKL+ +I QG++D           +  L+  
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLH-SIIQGNMDGCMKVEEFQNLIVDLRRE

Query:  AEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRA
          +L+ G+D ++K+VD FF I+LSGRD LL NLR+
Subjt:  AEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRA

Arabidopsis top hitse value%identityAlignment
AT1G01550.1 Protein of unknown function (DUF793)3.3e-9553.13Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA  + KL PK ++ IL+ SWM+ AM+ LCETHN +KTLI +L  PVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        L+ISVKLLD+CN FSSEL++LNQGHL+L+ ALHNLE+ S     +A+SSLD+W QHI S+  R+E+C  IL  L ++L+LPKVKNS+KGKVLM   YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLH-SIIQGNMDGCMKVEEFQNLIVDLRRE
        V TL+I  VFA+AFSGSS+ L+   V +   WA  F E+Q  +N EIK I+ S   T L++++AV   VKKL+ +I QG++D           +  L+  
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLH-SIIQGNMDGCMKVEEFQNLIVDLRRE

Query:  AEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRA
          +L+ G+D ++K+VD FF I+LSGRD LL NLR+
Subjt:  AEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRA

AT1G01550.2 Protein of unknown function (DUF793)3.3e-9553.13Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA  + KL PK ++ IL+ SWM+ AM+ LCETHN +KTLI +L  PVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        L+ISVKLLD+CN FSSEL++LNQGHL+L+ ALHNLE+ S     +A+SSLD+W QHI S+  R+E+C  IL  L ++L+LPKVKNS+KGKVLM   YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLH-SIIQGNMDGCMKVEEFQNLIVDLRRE
        V TL+I  VFA+AFSGSS+ L+   V +   WA  F E+Q  +N EIK I+ S   T L++++AV   VKKL+ +I QG++D           +  L+  
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLH-SIIQGNMDGCMKVEEFQNLIVDLRRE

Query:  AEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRA
          +L+ G+D ++K+VD FF I+LSGRD LL NLR+
Subjt:  AEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRA

AT2G46080.1 CONTAINS InterPro DOMAIN/s: Protein BYPASS related (InterPro:IPR008511)4.9e-10755.52Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSRPQ+P R FFPFGNPFR +S KGS LS  L+ LL  FE  L ERLKKL PK+++ IL+ SWM+LAM+ LCETH ++ TLI +L  PVSDW+EKW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        LNISV+LLD+CN FSSEL++LNQG L L+C LHNL+S S  +++QARSSLD+W QH+++   R+E+C  +LD L +SL LPKVKNS KGKVLM  FYG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        V T++ICSVF +A+S S+K L    V +   WA VFT++Q  VN EI+ + SSGR T L+++++VD SV+KL+ +IQ  +D  ++VE F++ +++L  +A
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMG
        EKL+QG+D L ++VD FF + L GRD LL NLR+S+++    +G
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMG

AT3G61500.1 unknown protein9.5e-2640.12Show/hide
Query:  FRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEK--WLDEYLNISVKLLDICNDFS
        FR + PK +        LL  FE SL ERLKKL P++ + IL+  WM LAM+LL +THND+  LI +L        E   W + Y+NI+ KLLD+CN F 
Subjt:  FRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEK--WLDEYLNISVKLLDICNDFS

Query:  SELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPK
        S L  +N G + L+   H LE  S     +  S+LD+W ++I   T+ +  C  +L R  ESL+  K
Subjt:  SELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPK

AT4G01360.1 unknown protein3.7e-5437.39Show/hide
Query:  NPFRAISPK--GSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEYLNISVKLLDICND
        NPF+ +  K   ++LS +L+ LL  FE +L   +++L PK +N I+S SWM  AM+ LCETH  ++TL+++L  PVSD +E ++  + + S+K  ++CN 
Subjt:  NPFRAISPK--GSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEYLNISVKLLDICND

Query:  FSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLD-------LPKVKNSSKGKVLMHVFYGAKVVTLFI
        F+SE+  L  G+L+L+ A   LE+ S        + L  WNQH+ S+   +E+   +L RL ES+D         K K S++GKVL+ V YG KV TL+I
Subjt:  FSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLD-------LPKVKNSSKGKVLMHVFYGAKVVTLFI

Query:  CSVFASAFSGSSKRLLPTNVP---DTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREAEKL
         SVF ++FSGSSK L    +P   +   W   F ELQ  +N EIK  + S  FT ++D++AV+  VKKL++ +Q      + VE  +  +++L    E +
Subjt:  CSVFASAFSGSSKRLLPTNVP---DTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREAEKL

Query:  TQGVDHLTKQVDEFFHIVLSGRDELLSNL
        ++    L+K       +V+S RD LL +L
Subjt:  TQGVDHLTKQVDEFFHIVLSGRDELLSNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACCAATTGTATCTGAAATTTTATTTCATGATTGATTTTCCATGTGATGGTGGCATGATGGATGGTTATCATGAACTTGTTTCTAAATGGCAGCTCATTCTTGGATT
TTATCCATCTTGCAGTTGGTGTTATATAATGAGCAGACCACAAGAGCCGCACCGGCCCTTCTTTCCTTTTGGGAATCCTTTCCGCGCAATATCACCAAAGGGTTCAAAAT
TGTCTTCTAGACTTGTTTTTCTTTTAGCTACTTTTGAGGATTCTTTGGCAGAGAGGCTGAAAAAGCTTACTCCAAAATCGGAGAATCACATACTCAGCTTCTCATGGATG
GAATTAGCAATGAAGCTGCTGTGTGAAACCCACAATGATGTAAAAACCCTTATAGAAGAGCTTGGGTTCCCTGTGTCTGACTGGGATGAGAAATGGCTAGATGAGTACCT
GAACATCAGTGTGAAATTACTTGATATATGCAATGATTTTAGCTCTGAGCTCTCCCAGTTGAATCAAGGTCATCTGGTACTTCGGTGTGCCTTGCACAATCTGGAATCTA
CATCTTCCAGCCAGTTTGTTCAGGCCCGTTCTTCGCTAGATGCATGGAATCAACATATTAGTTCCAGAACCTCCAGAGTTGAGAGCTGTTCTCCTATTTTGGACCGTCTT
GAGGAATCACTTGATCTTCCAAAAGTTAAGAACTCATCCAAAGGCAAGGTTTTGATGCATGTGTTTTATGGAGCGAAGGTGGTGACTTTGTTTATTTGCAGTGTTTTTGC
TTCTGCCTTCTCAGGTTCTTCCAAAAGGTTGTTACCCACCAATGTTCCAGATACATTCAGATGGGCGCACGTGTTTACTGAATTACAGAAAAATGTAAATATGGAAATTA
AAAAAATTTATTCTAGTGGAAGGTTTACTGCGTTGAGAGATGTTGATGCAGTTGATGAGAGTGTAAAAAAACTGCATTCCATAATTCAAGGAAATATGGATGGCTGCATG
AAAGTGGAAGAATTCCAGAATTTGATTGTAGATTTGAGGAGGGAAGCAGAGAAGCTTACACAAGGTGTTGATCATCTTACAAAACAAGTTGATGAGTTTTTTCACATTGT
TTTATCTGGACGGGATGAATTGCTTTCAAATCTTAGAGCAAGTGAAACAGTATTTGATCAGGGTATGGGAGGGCTGTCTCCAAGGAAACTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCACCAATTGTATCTGAAATTTTATTTCATGATTGATTTTCCATGTGATGGTGGCATGATGGATGGTTATCATGAACTTGTTTCTAAATGGCAGCTCATTCTTGGATT
TTATCCATCTTGCAGTTGGTGTTATATAATGAGCAGACCACAAGAGCCGCACCGGCCCTTCTTTCCTTTTGGGAATCCTTTCCGCGCAATATCACCAAAGGGTTCAAAAT
TGTCTTCTAGACTTGTTTTTCTTTTAGCTACTTTTGAGGATTCTTTGGCAGAGAGGCTGAAAAAGCTTACTCCAAAATCGGAGAATCACATACTCAGCTTCTCATGGATG
GAATTAGCAATGAAGCTGCTGTGTGAAACCCACAATGATGTAAAAACCCTTATAGAAGAGCTTGGGTTCCCTGTGTCTGACTGGGATGAGAAATGGCTAGATGAGTACCT
GAACATCAGTGTGAAATTACTTGATATATGCAATGATTTTAGCTCTGAGCTCTCCCAGTTGAATCAAGGTCATCTGGTACTTCGGTGTGCCTTGCACAATCTGGAATCTA
CATCTTCCAGCCAGTTTGTTCAGGCCCGTTCTTCGCTAGATGCATGGAATCAACATATTAGTTCCAGAACCTCCAGAGTTGAGAGCTGTTCTCCTATTTTGGACCGTCTT
GAGGAATCACTTGATCTTCCAAAAGTTAAGAACTCATCCAAAGGCAAGGTTTTGATGCATGTGTTTTATGGAGCGAAGGTGGTGACTTTGTTTATTTGCAGTGTTTTTGC
TTCTGCCTTCTCAGGTTCTTCCAAAAGGTTGTTACCCACCAATGTTCCAGATACATTCAGATGGGCGCACGTGTTTACTGAATTACAGAAAAATGTAAATATGGAAATTA
AAAAAATTTATTCTAGTGGAAGGTTTACTGCGTTGAGAGATGTTGATGCAGTTGATGAGAGTGTAAAAAAACTGCATTCCATAATTCAAGGAAATATGGATGGCTGCATG
AAAGTGGAAGAATTCCAGAATTTGATTGTAGATTTGAGGAGGGAAGCAGAGAAGCTTACACAAGGTGTTGATCATCTTACAAAACAAGTTGATGAGTTTTTTCACATTGT
TTTATCTGGACGGGATGAATTGCTTTCAAATCTTAGAGCAAGTGAAACAGTATTTGATCAGGGTATGGGAGGGCTGTCTCCAAGGAAACTTTGA
Protein sequenceShow/hide protein sequence
MHQLYLKFYFMIDFPCDGGMMDGYHELVSKWQLILGFYPSCSWCYIMSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWM
ELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEYLNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRL
EESLDLPKVKNSSKGKVLMHVFYGAKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCM
KVEEFQNLIVDLRREAEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQGMGGLSPRKL