; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G020580 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G020580
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionprotein BPS1, chloroplastic-like
Genome locationchr05:27505188..27506955
RNA-Seq ExpressionLsi05G020580
SyntenyLsi05G020580
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008511 - Protein BYPASS-related


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019020.1 Protein BPS1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]8.0e-16083.19Show/hide
Query:  CSWCYIMSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDE
        CS CYIMS+PQEPHRPFF FGNPFRAISPKGSKLSSRLVFLLATFEDSLA++L KLTPKS+N + SF WM LAMKLLCE HNDVKTLI +L  PVSDW E
Subjt:  CSWCYIMSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDE

Query:  KWLDEYLNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMH
        KWLDEYL+ISVKLLDICNDFSS+LSQLNQGHL+LRCALHNL STSS+QFV ARSSLDAWNQHISSRTSRVE+ SPI+D  EE LDLPKVKNS KGKVLM 
Subjt:  KWLDEYLNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMH

Query:  VFYGAKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIV
        V YG KV TLFICSVFA AFSGSSKRLL  NVPDT+RWA  FTELQKNVNM IK  +SSGRFTALRD++AVDE VKKLHS+IQ N+DG MKVEE QNLIV
Subjt:  VFYGAKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIV

Query:  DLRREAEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQVTEG
        DLR EAEKLTQGVDHLTKQVDEFF+IVLSGRD LLSNLR+SETVFDQ  EG
Subjt:  DLRREAEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQVTEG

XP_004133852.1 protein BPS1, chloroplastic [Cucumis sativus]5.2e-16788.56Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRL FLLATFEDSLAERLKKLTPKSEN ILSFSWMELAMKLL ETHNDVKTL+EELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        LNISVKLLDICNDFSSELSQLNQGHL+LRCALHNLESTSS+Q V+A SSLDAWNQHISSRTSRV+SC PILD L ESLDLPKVKNSSKGKVLMHV Y  K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA  FTELQK VNMEIKKIYSSGRFTALRDVDAV+E VKKLHS+IQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQ
        E LTQGVDHLTKQVDEFFHIVLSGRDELLSNLRA+E VF Q
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQ

XP_008438008.1 PREDICTED: protein BPS1, chloroplastic-like [Cucumis melo]1.0e-16789.44Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRLVFLLATFEDSLAERLKKLTPKSEN ILSFSWMELAMKLL ETHN+VKTLIEELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        LNISVKLLDICNDFSSELSQLNQGHL+LRCALHNLESTSS+Q V+A SSLDAWNQHISSRTSRV+S SPILD L+ESLDLPKVKNSSKGKVLMH  YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA  FTELQK VNMEIKKIYSSGRFTALRDVDAV+E VKKLHS+IQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQ
        E LTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVF Q
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQ

XP_022147420.1 protein BPS1, chloroplastic-like isoform X1 [Momordica charantia]1.7e-16285.34Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLA FEDSLAERLKKLTPKS+N I+SFSWMELAMKLLCETHNDVKTLIE+LG PVSDW+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        L ISVKLLDICNDFSSELSQLNQG LVLRCALHNLESTSS+QFVQA SSLD W++HISSRTSR ESC  ILD LEESLD PKVKNSSKGKVLMHV YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        V TLFICSVFAS FSGSSK+L P  VPDT+RWA  FT+LQKNVN+ IKKIYSSGRFT LR+++AVDESV  LHS+IQGN+DG +K EEFQNL+VD RREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQ
        EKL+QG+DHLTKQVDEFF IVLSGRD LLSNLRASETVFDQ
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQ

XP_038877787.1 protein BPS1, chloroplastic-like [Benincasa hispida]6.1e-17692.46Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSRPQEPH PFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSEN ILSFSWMELAM LLCETHNDVKTLIEELGFP SDWDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        LN+SVKLLDICNDFSSELSQLNQGHL++RCALHNLESTSS QFV A SSLDAWNQHISSRTSRVES S ILD LEESLDLPKVKNSSKGKVLMHV YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICS FASA SGSSKRLLPTNV DTFRWAH FTELQKNVNMEIKKIYSSGR TALRDVDAVDESVKKLHS+IQGNMDGCMKVEEFQ LIVDLRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQVTEG
        EKLTQGVDHLTKQVD FFHIVLSGRD LLSNLRASETVFDQ  EG
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQVTEG

TrEMBL top hitse value%identityAlignment
A0A0A0L3G5 Uncharacterized protein2.5e-16788.56Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRL FLLATFEDSLAERLKKLTPKSEN ILSFSWMELAMKLL ETHNDVKTL+EELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        LNISVKLLDICNDFSSELSQLNQGHL+LRCALHNLESTSS+Q V+A SSLDAWNQHISSRTSRV+SC PILD L ESLDLPKVKNSSKGKVLMHV Y  K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA  FTELQK VNMEIKKIYSSGRFTALRDVDAV+E VKKLHS+IQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQ
        E LTQGVDHLTKQVDEFFHIVLSGRDELLSNLRA+E VF Q
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQ

A0A1S3AVG0 protein BPS1, chloroplastic-like5.0e-16889.44Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRLVFLLATFEDSLAERLKKLTPKSEN ILSFSWMELAMKLL ETHN+VKTLIEELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        LNISVKLLDICNDFSSELSQLNQGHL+LRCALHNLESTSS+Q V+A SSLDAWNQHISSRTSRV+S SPILD L+ESLDLPKVKNSSKGKVLMH  YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA  FTELQK VNMEIKKIYSSGRFTALRDVDAV+E VKKLHS+IQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQ
        E LTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVF Q
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQ

A0A5D3D1K6 Protein BPS15.0e-16889.44Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRLVFLLATFEDSLAERLKKLTPKSEN ILSFSWMELAMKLL ETHN+VKTLIEELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        LNISVKLLDICNDFSSELSQLNQGHL+LRCALHNLESTSS+Q V+A SSLDAWNQHISSRTSRV+S SPILD L+ESLDLPKVKNSSKGKVLMH  YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA  FTELQK VNMEIKKIYSSGRFTALRDVDAV+E VKKLHS+IQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQ
        E LTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVF Q
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQ

A0A6J1D2B3 protein BPS1, chloroplastic-like isoform X18.3e-16385.34Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLA FEDSLAERLKKLTPKS+N I+SFSWMELAMKLLCETHNDVKTLIE+LG PVSDW+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        L ISVKLLDICNDFSSELSQLNQG LVLRCALHNLESTSS+QFVQA SSLD W++HISSRTSR ESC  ILD LEESLD PKVKNSSKGKVLMHV YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        V TLFICSVFAS FSGSSK+L P  VPDT+RWA  FT+LQKNVN+ IKKIYSSGRFT LR+++AVDESV  LHS+IQGN+DG +K EEFQNL+VD RREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQ
        EKL+QG+DHLTKQVDEFF IVLSGRD LLSNLRASETVFDQ
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQ

A0A6J1ISZ1 protein BPS1, chloroplastic-like4.4e-15682.61Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFF FGNPFRAISPKGSKLSSRLVFLLATFEDSLA++L KLTPKS+N + SF WM LAMKLLCE HNDVKTLI +L  PVSDW EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        L+ISVKLLDICNDFSS+LSQLNQGHL+LRCALHNL STSS+QFV AR SLDAWNQHISSRTSRVE+ SPI+D LEE LDLPKVKNS KGKVLM V YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        V TLFICSVFA AFSGSSKRLL  +VPDT+RWA  FTELQKNVNM IK  +SSGRFTALR +DAVDE VKKLHS+IQ N+DG MKVEE QNLIVDLR EA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQVTEG
        EKLTQGVDHLTKQVDEFFH+VLSGRD LLSNLR+SETVF Q  EG
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQVTEG

SwissProt top hitse value%identityAlignment
A2Z9A6 UPF0496 protein 46.5e-6439.5Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSR     R FFP GNPFR + P G+ LS +L  LLA++ED+LA  L+KL P++ + +L+ SWM LA+  L E H ++  LI +L  PVSDWD+KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLES----TSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVF
        LN SVKLLDIC   SSELS+L+QG L+L+ ALH L S     S  Q  +A  SL  W + +  R  R+ SCS  L  L  +L L KVKNS KGKVLM   
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLES----TSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVF

Query:  YGAKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDL
        YG + VT+F+CS+F +  SGS K L+  +VP+ F W+  F +L   V+ E+ +  + G   A+++++ V+   K+LH +   +       EE  NL   +
Subjt:  YGAKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDL

Query:  RREAEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQVTEGKDQKHEFS
            E++    D + ++ D    + L+        +  SE++ ++ T   + K + S
Subjt:  RREAEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQVTEGKDQKHEFS

Q337C0 UPF0496 protein 41.7e-6439.5Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSR     R FFP GNPFR + P G+ LS +L  LLA++ED+LA  L+KL P++ + +L+ SWM LA+  L E H ++  LI +L  PVSDWD+KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLES----TSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVF
        LN SVKLLDIC   SSELS+L+QG L+L+ ALH L S     S  Q  +A  SL  W + +  R +R+ SCS  L  L  +L L KVKNS+KGKVLM   
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLES----TSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVF

Query:  YGAKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDL
        YG + VT+F+CS+F +  SGS K L+  +VP+ F W+  F +L   V+ E+ +  S G   A+++++ V+   ++LH +   +       EE  NL   +
Subjt:  YGAKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDL

Query:  RREAEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQVTEGKDQKHEFS
            E++    D + ++ D    + L+        +  SE++ +  T+  + K + S
Subjt:  RREAEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQVTEGKDQKHEFS

Q9LMM6 Protein BPS1, chloroplastic2.1e-9452.16Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA  + KL PK ++ IL+ SWM+ AM+ LCETHN +KTLI +L  PVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        L+ISVKLLD+CN FSSEL++LNQGHL+L+ ALHNLE+ S     +A+SSLD+W QHI S+  R+E+C  IL  L ++L+LPKVKNS+KGKVLM   YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLH-SIIQGNMDGCMKVEEFQNLIVDLRRE
        V TL+I  VFA+AFSGSS+ L+   V +   WA  F E+Q  +N EIK I+ S   T L++++AV   VKKL+ +I QG++D           +  L+  
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLH-SIIQGNMDGCMKVEEFQNLIVDLRRE

Query:  AEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQVTEGK
          +L+ G+D ++K+VD FF I+LSGRD LL NLR+      Q T  K
Subjt:  AEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQVTEGK

Arabidopsis top hitse value%identityAlignment
AT1G01550.1 Protein of unknown function (DUF793)1.5e-9552.16Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA  + KL PK ++ IL+ SWM+ AM+ LCETHN +KTLI +L  PVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        L+ISVKLLD+CN FSSEL++LNQGHL+L+ ALHNLE+ S     +A+SSLD+W QHI S+  R+E+C  IL  L ++L+LPKVKNS+KGKVLM   YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLH-SIIQGNMDGCMKVEEFQNLIVDLRRE
        V TL+I  VFA+AFSGSS+ L+   V +   WA  F E+Q  +N EIK I+ S   T L++++AV   VKKL+ +I QG++D           +  L+  
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLH-SIIQGNMDGCMKVEEFQNLIVDLRRE

Query:  AEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQVTEGK
          +L+ G+D ++K+VD FF I+LSGRD LL NLR+      Q T  K
Subjt:  AEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQVTEGK

AT1G01550.2 Protein of unknown function (DUF793)1.5e-9552.16Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA  + KL PK ++ IL+ SWM+ AM+ LCETHN +KTLI +L  PVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        L+ISVKLLD+CN FSSEL++LNQGHL+L+ ALHNLE+ S     +A+SSLD+W QHI S+  R+E+C  IL  L ++L+LPKVKNS+KGKVLM   YG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLH-SIIQGNMDGCMKVEEFQNLIVDLRRE
        V TL+I  VFA+AFSGSS+ L+   V +   WA  F E+Q  +N EIK I+ S   T L++++AV   VKKL+ +I QG++D           +  L+  
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLH-SIIQGNMDGCMKVEEFQNLIVDLRRE

Query:  AEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQVTEGK
          +L+ G+D ++K+VD FF I+LSGRD LL NLR+      Q T  K
Subjt:  AEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQVTEGK

AT2G46080.1 CONTAINS InterPro DOMAIN/s: Protein BYPASS related (InterPro:IPR008511)1.9e-10656.21Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSRPQ+P R FFPFGNPFR +S KGS LS  L+ LL  FE  L ERLKKL PK+++ IL+ SWM+LAM+ LCETH ++ TLI +L  PVSDW+EKW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK
        LNISV+LLD+CN FSSEL++LNQG L L+C LHNL+S S  +++QARSSLD+W QH+++   R+E+C  +LD L +SL LPKVKNS KGKVLM  FYG K
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVKNSSKGKVLMHVFYGAK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA
        V T++ICSVF +A+S S+K L    V +   WA VFT++Q  VN EI+ + SSGR T L+++++VD SV+KL+ +IQ  +D  ++VE F++ +++L  +A
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETV
        EKL+QG+D L ++VD FF + L GRD LL NLR+S+++
Subjt:  EKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETV

AT3G61500.1 unknown protein9.4e-2640.12Show/hide
Query:  FRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEK--WLDEYLNISVKLLDICNDFS
        FR + PK +        LL  FE SL ERLKKL P++ + IL+  WM LAM+LL +THND+  LI +L        E   W + Y+NI+ KLLD+CN F 
Subjt:  FRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEK--WLDEYLNISVKLLDICNDFS

Query:  SELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPK
        S L  +N G + L+   H LE  S     +  S+LD+W ++I   T+ +  C  +L R  ESL+  K
Subjt:  SELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPK

AT4G01360.1 unknown protein3.7e-5437.39Show/hide
Query:  NPFRAISPK--GSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEYLNISVKLLDICND
        NPF+ +  K   ++LS +L+ LL  FE +L   +++L PK +N I+S SWM  AM+ LCETH  ++TL+++L  PVSD +E ++  + + S+K  ++CN 
Subjt:  NPFRAISPK--GSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCETHNDVKTLIEELGFPVSDWDEKWLDEYLNISVKLLDICND

Query:  FSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLD-------LPKVKNSSKGKVLMHVFYGAKVVTLFI
        F+SE+  L  G+L+L+ A   LE+ S        + L  WNQH+ S+   +E+   +L RL ES+D         K K S++GKVL+ V YG KV TL+I
Subjt:  FSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLD-------LPKVKNSSKGKVLMHVFYGAKVVTLFI

Query:  CSVFASAFSGSSKRLLPTNVP---DTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREAEKL
         SVF ++FSGSSK L    +P   +   W   F ELQ  +N EIK  + S  FT ++D++AV+  VKKL++ +Q      + VE  +  +++L    E +
Subjt:  CSVFASAFSGSSKRLLPTNVP---DTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIVDLRREAEKL

Query:  TQGVDHLTKQVDEFFHIVLSGRDELLSNL
        ++    L+K       +V+S RD LL +L
Subjt:  TQGVDHLTKQVDEFFHIVLSGRDELLSNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGATTTTCCATGTGATGGTGGCATGATGGATGGTTATCATGAACTTGTTTCTAAATGGCAGCTCATTCTTGGATTTTATCCATCTTGCAGTTGGTGTTATATAAT
GAGCAGACCACAAGAGCCGCACCGGCCCTTCTTTCCTTTTGGGAATCCTTTCCGCGCAATATCACCAAAGGGTTCAAAATTGTCTTCTAGACTTGTTTTTCTTTTAGCTA
CTTTTGAGGATTCTTTGGCAGAGAGGCTGAAAAAGCTTACTCCAAAATCGGAGAATCACATACTCAGCTTCTCATGGATGGAATTAGCAATGAAGCTGCTGTGTGAAACC
CACAATGATGTAAAAACCCTTATAGAAGAGCTTGGGTTCCCTGTGTCTGACTGGGATGAGAAATGGCTAGATGAGTACCTGAACATCAGTGTGAAATTACTTGATATATG
CAATGATTTTAGCTCTGAGCTCTCCCAGTTGAATCAAGGTCATCTGGTACTTCGGTGTGCCTTGCACAATCTGGAATCTACATCTTCCAGCCAGTTTGTTCAGGCCCGTT
CTTCGCTAGATGCATGGAATCAACATATTAGTTCCAGAACCTCCAGAGTTGAGAGCTGTTCTCCTATTTTGGACCGTCTTGAGGAATCACTTGATCTTCCAAAAGTTAAG
AACTCATCCAAAGGCAAGGTTTTGATGCATGTGTTTTATGGAGCGAAGGTGGTGACTTTGTTTATTTGCAGTGTTTTTGCTTCTGCCTTCTCAGGTTCTTCCAAAAGGTT
GTTACCCACCAATGTTCCAGATACATTCAGATGGGCGCACGTGTTTACTGAATTACAGAAAAATGTAAATATGGAAATTAAAAAAATTTATTCTAGTGGAAGGTTTACTG
CGTTGAGAGATGTTGATGCAGTTGATGAGAGTGTAAAAAAACTGCATTCCATAATTCAAGGAAATATGGATGGCTGCATGAAAGTGGAAGAATTCCAGAATTTGATTGTA
GATTTGAGGAGGGAAGCAGAGAAGCTTACACAAGGTGTTGATCATCTTACAAAACAAGTTGATGAGTTTTTTCACATTGTTTTATCTGGACGGGATGAATTGCTTTCAAA
TCTTAGAGCAAGTGAAACAGTATTTGATCAGGTGACAGAAGGAAAAGATCAGAAGCATGAGTTTTCATGCAGTGGTGCAGAATTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTGATTTTCCATGTGATGGTGGCATGATGGATGGTTATCATGAACTTGTTTCTAAATGGCAGCTCATTCTTGGATTTTATCCATCTTGCAGTTGGTGTTATATAAT
GAGCAGACCACAAGAGCCGCACCGGCCCTTCTTTCCTTTTGGGAATCCTTTCCGCGCAATATCACCAAAGGGTTCAAAATTGTCTTCTAGACTTGTTTTTCTTTTAGCTA
CTTTTGAGGATTCTTTGGCAGAGAGGCTGAAAAAGCTTACTCCAAAATCGGAGAATCACATACTCAGCTTCTCATGGATGGAATTAGCAATGAAGCTGCTGTGTGAAACC
CACAATGATGTAAAAACCCTTATAGAAGAGCTTGGGTTCCCTGTGTCTGACTGGGATGAGAAATGGCTAGATGAGTACCTGAACATCAGTGTGAAATTACTTGATATATG
CAATGATTTTAGCTCTGAGCTCTCCCAGTTGAATCAAGGTCATCTGGTACTTCGGTGTGCCTTGCACAATCTGGAATCTACATCTTCCAGCCAGTTTGTTCAGGCCCGTT
CTTCGCTAGATGCATGGAATCAACATATTAGTTCCAGAACCTCCAGAGTTGAGAGCTGTTCTCCTATTTTGGACCGTCTTGAGGAATCACTTGATCTTCCAAAAGTTAAG
AACTCATCCAAAGGCAAGGTTTTGATGCATGTGTTTTATGGAGCGAAGGTGGTGACTTTGTTTATTTGCAGTGTTTTTGCTTCTGCCTTCTCAGGTTCTTCCAAAAGGTT
GTTACCCACCAATGTTCCAGATACATTCAGATGGGCGCACGTGTTTACTGAATTACAGAAAAATGTAAATATGGAAATTAAAAAAATTTATTCTAGTGGAAGGTTTACTG
CGTTGAGAGATGTTGATGCAGTTGATGAGAGTGTAAAAAAACTGCATTCCATAATTCAAGGAAATATGGATGGCTGCATGAAAGTGGAAGAATTCCAGAATTTGATTGTA
GATTTGAGGAGGGAAGCAGAGAAGCTTACACAAGGTGTTGATCATCTTACAAAACAAGTTGATGAGTTTTTTCACATTGTTTTATCTGGACGGGATGAATTGCTTTCAAA
TCTTAGAGCAAGTGAAACAGTATTTGATCAGGTGACAGAAGGAAAAGATCAGAAGCATGAGTTTTCATGCAGTGGTGCAGAATTCTAG
Protein sequenceShow/hide protein sequence
MIDFPCDGGMMDGYHELVSKWQLILGFYPSCSWCYIMSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENHILSFSWMELAMKLLCET
HNDVKTLIEELGFPVSDWDEKWLDEYLNISVKLLDICNDFSSELSQLNQGHLVLRCALHNLESTSSSQFVQARSSLDAWNQHISSRTSRVESCSPILDRLEESLDLPKVK
NSSKGKVLMHVFYGAKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHVFTELQKNVNMEIKKIYSSGRFTALRDVDAVDESVKKLHSIIQGNMDGCMKVEEFQNLIV
DLRREAEKLTQGVDHLTKQVDEFFHIVLSGRDELLSNLRASETVFDQVTEGKDQKHEFSCSGAEF