; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G01900 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G01900
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein BPS1, chloroplastic-like
Genome locationClcChr05:1321073..1324710
RNA-Seq ExpressionClc05G01900
SyntenyClc05G01900
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR008511 - Protein BYPASS-related


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019020.1 Protein BPS1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]5.6e-15782.86Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFF FGNPFRAISPKGSKLSSRLVFLLATFEDSLA++L KLTPKS+ND+ SF WM LA+KLL E HNDVKTLI +L  PVSDW EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        L+ISVKLLDICNDFSS+LSQLNQGHLILRCALHNL STSSN FV ARSSLDAWNQHISSRTSRVE+ +PI+D  EE LDLPKVKNS KGKVLM V+YGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        V TLFICSVFA AFSGSSKRLL  NVPDT+RWA AFTELQKNVNM IK  +SSGRFTALRD++AVD  VKKLHSMIQ N+DG MKVEE QNLIVDLR EA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQ
        EKLTQGVD LTKQVDEFF+IVLSGRDALLSNLR+SET FD+G+  LSTRQ
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQ

XP_004133852.1 protein BPS1, chloroplastic [Cucumis sativus]8.3e-16987.75Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRL FLLATFEDSLAERLKKLTPKSENDILSFSWMELA+KLL ETHNDVKTL+EELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSN  V A SSLDAWNQHISSRTSRV+SC PILD L ESLDLPKVKNSSKGKVLMHVLY VK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA AFTELQK VNMEIK IYSSGRFTALRDVDAV+  VKKLHSMIQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL
        E LTQGVD LTKQVDEFFHIVLSGRD LLSNLRA+E  F +G+G L TRQL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL

XP_008438008.1 PREDICTED: protein BPS1, chloroplastic-like [Cucumis melo]4.9e-16988.32Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELA+KLL ETHN+VKTLIEELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSN  V A SSLDAWNQHISSRTSRV+S +PILD L+ESLDLPKVKNSSKGKVLMH LYGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA AFTELQK VNMEIK IYSSGRFTALRDVDAV+  VKKLHSMIQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL
        E LTQGVD LTKQVDEFFHIVLSGRD LLSNLRASET F +G+G L TRQL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL

XP_022147420.1 protein BPS1, chloroplastic-like isoform X1 [Momordica charantia]1.5e-16283.81Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLA FEDSLAERLKKLTPKS+NDI+SFSWMELA+KLL ETHNDVKTLIE+LG PVSDW+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        L ISVKLLDICNDFSSELSQLNQG L+LRCALHNLESTSSN FV A SSLD W++HISSRTSR ESC  ILD LEESLD PKVKNSSKGKVLMHV+YGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        V TLFICSVFAS FSGSSK+L P  VPDT+RWA AFT+LQKNVN+ IK IYSSGRFT LR+++AVD SV  LHSMIQGN+DG +K EEFQNL+VD RREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLS-TRQL
        EKL+QG+D LTKQVDEFF IVLSGRDALLSNLRASET FD+G+G LS TRQL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLS-TRQL

XP_038877787.1 protein BPS1, chloroplastic-like [Benincasa hispida]2.2e-17792.02Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSRPQEPH PFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELA+ LL ETHNDVKTLIEELGFP SDWDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        LN+SVKLLDICNDFSSELSQLNQGHLI+RCALHNLESTSS+ FVHA SSLDAWNQHISSRTSRVES + ILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICS FASA SGSSKRLLPTNV DTFRWAHAFTELQKNVNMEIK IYSSGR TALRDVDAVD SVKKLHSMIQGNMDGCMKVEEFQ LIVDLRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL
        EKLTQGVD LTKQVD FFHIVLSGRDALLSNLRASET FD+G+   STRQL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL

TrEMBL top hitse value%identityAlignment
A0A0A0L3G5 Uncharacterized protein4.0e-16987.75Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRL FLLATFEDSLAERLKKLTPKSENDILSFSWMELA+KLL ETHNDVKTL+EELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSN  V A SSLDAWNQHISSRTSRV+SC PILD L ESLDLPKVKNSSKGKVLMHVLY VK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA AFTELQK VNMEIK IYSSGRFTALRDVDAV+  VKKLHSMIQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL
        E LTQGVD LTKQVDEFFHIVLSGRD LLSNLRA+E  F +G+G L TRQL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL

A0A1S3AVG0 protein BPS1, chloroplastic-like2.4e-16988.32Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELA+KLL ETHN+VKTLIEELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSN  V A SSLDAWNQHISSRTSRV+S +PILD L+ESLDLPKVKNSSKGKVLMH LYGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA AFTELQK VNMEIK IYSSGRFTALRDVDAV+  VKKLHSMIQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL
        E LTQGVD LTKQVDEFFHIVLSGRD LLSNLRASET F +G+G L TRQL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL

A0A5D3D1K6 Protein BPS12.4e-16988.32Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELA+KLL ETHN+VKTLIEELGFPVS+WDEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSN  V A SSLDAWNQHISSRTSRV+S +PILD L+ESLDLPKVKNSSKGKVLMH LYGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        VVTLFICSVFAS+FSGSS+ LLPTNVPD+FRWA AFTELQK VNMEIK IYSSGRFTALRDVDAV+  VKKLHSMIQGNMD C   EEFQNLIV+LRREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL
        E LTQGVD LTKQVDEFFHIVLSGRD LLSNLRASET F +G+G L TRQL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQL

A0A6J1D2B3 protein BPS1, chloroplastic-like isoform X17.4e-16383.81Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLA FEDSLAERLKKLTPKS+NDI+SFSWMELA+KLL ETHNDVKTLIE+LG PVSDW+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        L ISVKLLDICNDFSSELSQLNQG L+LRCALHNLESTSSN FV A SSLD W++HISSRTSR ESC  ILD LEESLD PKVKNSSKGKVLMHV+YGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        V TLFICSVFAS FSGSSK+L P  VPDT+RWA AFT+LQKNVN+ IK IYSSGRFT LR+++AVD SV  LHSMIQGN+DG +K EEFQNL+VD RREA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLS-TRQL
        EKL+QG+D LTKQVDEFF IVLSGRDALLSNLRASET FD+G+G LS TRQL
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLS-TRQL

A0A6J1E9E6 protein BPS1, chloroplastic-like isoform X21.3e-15682.29Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MS+PQEPHRPFF FGNPFRAISPKGSKLSSRLVFLLATFEDSLA++L+KLTPKS++D+ SF WM LA+KLL E HNDVKTLI +L  PVSDW EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        L+ISVKLLDICNDFSS+LSQLNQGHLILRCALHNL STSSN FV ARSSLDAWNQHISSRTSRVE+ +PI+D  EE LDLPKVKNS KGKVLM V+YGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        V TLFICSVFA AFSGSSKRLL  NVPDT+RWA AF ELQKNVNM IK  +SSGRFTALRD++AVD  VKKLHSMIQ N+DG MKVEE QNLIVDLR EA
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQ
        EKLTQGVD LTKQVDEFF+IVLSGRDALLSNLR+SET FD+G+  LSTRQ
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIGVLSTRQ

SwissProt top hitse value%identityAlignment
A2Z9A6 UPF0496 protein 44.0e-6536.1Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSR     R FFP GNPFR + P G+ LS +L  LLA++ED+LA  L+KL P++ +D+L+ SWM LA+  L E H ++  LI +L  PVSDWD+KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLES----TSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVL
        LN SVKLLDIC   SSELS+L+QG L+L+ ALH L S     S      A  SL  W + +  R  R+ SC+  L  L  +L L KVKNS KGKVLM  L
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLES----TSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVL

Query:  YGVKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLH-----------------------
        YG++ VT+F+CS+F +  SGS K L+  +VP+ F W+ AF +L   V+ E+    + G   A+++++ V+   K+LH                       
Subjt:  YGVKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLH-----------------------

Query:  ------SMIQ-----------------------------GNMDGCMK------------------------------------------------VEEFQ
              S++Q                             G  +  MK                                                 EE  
Subjt:  ------SMIQ-----------------------------GNMDGCMK------------------------------------------------VEEFQ

Query:  NLIVDLRREAEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASE
        N I  + + AE L  G+D L+K+V +FF IVL+GRDALL NLR S+
Subjt:  NLIVDLRREAEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASE

Q337C0 UPF0496 protein 41.4e-6535.65Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSR     R FFP GNPFR + P G+ LS +L  LLA++ED+LA  L+KL P++ +D+L+ SWM LA+  L E H ++  LI +L  PVSDWD+KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLES----TSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVL
        LN SVKLLDIC   SSELS+L+QG L+L+ ALH L S     S      A  SL  W + +  R +R+ SC+  L  L  +L L KVKNS+KGKVLM  L
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLES----TSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVL

Query:  YGVKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQ-------------------
        YG++ VT+F+CS+F +  SGS K L+  +VP+ F W+ AF +L   V+ E+    S G   A+++++ V+   ++LH +                     
Subjt:  YGVKVVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQ-------------------

Query:  ---------------------------------------GNMDGCMK------------------------------------------------VEEFQ
                                               G  +  MK                                                 EE  
Subjt:  ---------------------------------------GNMDGCMK------------------------------------------------VEEFQ

Query:  NLIVDLRREAEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASE
        N I  + + AE L  G+D L+K+V +FF IVL+GRDALL NLR S+
Subjt:  NLIVDLRREAEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASE

Q9LMM6 Protein BPS1, chloroplastic6.3e-9553.73Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA  + KL PK ++DIL+ SWM+ A++ L ETHN +KTLI +L  PVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        L+ISVKLLD+CN FSSEL++LNQGHL+L+ ALHNLE+ S      A+SSLD+W QHI S+  R+E+C  IL  L ++L+LPKVKNS+KGKVLM  LYGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMI-QGNMDGCMKVEEFQNLIVDLRRE
        V TL+I  VFA+AFSGSS+ L+   V +   WA +F E+Q  +N EIKNI+ S   T L++++AV   VKKL+  I QG++D           +  L+  
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMI-QGNMDGCMKVEEFQNLIVDLRRE

Query:  AEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRA
          +L+ G+D ++K+VD FF I+LSGRD LL NLR+
Subjt:  AEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRA

Arabidopsis top hitse value%identityAlignment
AT1G01550.1 Protein of unknown function (DUF793)4.5e-9653.73Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA  + KL PK ++DIL+ SWM+ A++ L ETHN +KTLI +L  PVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        L+ISVKLLD+CN FSSEL++LNQGHL+L+ ALHNLE+ S      A+SSLD+W QHI S+  R+E+C  IL  L ++L+LPKVKNS+KGKVLM  LYGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMI-QGNMDGCMKVEEFQNLIVDLRRE
        V TL+I  VFA+AFSGSS+ L+   V +   WA +F E+Q  +N EIKNI+ S   T L++++AV   VKKL+  I QG++D           +  L+  
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMI-QGNMDGCMKVEEFQNLIVDLRRE

Query:  AEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRA
          +L+ G+D ++K+VD FF I+LSGRD LL NLR+
Subjt:  AEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRA

AT1G01550.2 Protein of unknown function (DUF793)4.5e-9653.73Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA  + KL PK ++DIL+ SWM+ A++ L ETHN +KTLI +L  PVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        L+ISVKLLD+CN FSSEL++LNQGHL+L+ ALHNLE+ S      A+SSLD+W QHI S+  R+E+C  IL  L ++L+LPKVKNS+KGKVLM  LYGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMI-QGNMDGCMKVEEFQNLIVDLRRE
        V TL+I  VFA+AFSGSS+ L+   V +   WA +F E+Q  +N EIKNI+ S   T L++++AV   VKKL+  I QG++D           +  L+  
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMI-QGNMDGCMKVEEFQNLIVDLRRE

Query:  AEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRA
          +L+ G+D ++K+VD FF I+LSGRD LL NLR+
Subjt:  AEKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRA

AT2G46080.1 CONTAINS InterPro DOMAIN/s: Protein BYPASS related (InterPro:IPR008511)2.4e-10554.94Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY
        MSRPQ+P R FFPFGNPFR +S KGS LS  L+ LL  FE  L ERLKKL PK+++DIL+ SWM+LA++ L ETH ++ TLI +L  PVSDW+EKW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEY

Query:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK
        LNISV+LLD+CN FSSEL++LNQG L L+C LHNL+S S   ++ ARSSLD+W QH+++   R+E+C  +LD L +SL LPKVKNS KGKVLM   YGVK
Subjt:  LNISVKLLDICNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVK

Query:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA
        V T++ICSVF +A+S S+K L    V +   WA  FT++Q  VN EI+++ SSGR T L+++++VD SV+KL+ MIQ  +D  ++VE F++ +++L  +A
Subjt:  VVTLFICSVFASAFSGSSKRLLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREA

Query:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIG
        EKL+QG+D+L ++VD FF + L GRD LL NLR+S++     +G
Subjt:  EKLTQGVDRLTKQVDEFFHIVLSGRDALLSNLRASETEFDRGIG

AT3G61500.1 unknown protein1.7e-2640.12Show/hide
Query:  FRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEK--WLDEYLNISVKLLDICNDFS
        FR + PK +        LL  FE SL ERLKKL P++ ++IL+  WM LA++LLY+THND+  LI +L        E   W + Y+NI+ KLLD+CN F 
Subjt:  FRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEK--WLDEYLNISVKLLDICNDFS

Query:  SELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPK
        S L  +N G + L+   H LE  S +L     S+LD+W ++I   T+ +  C  +L    ESL+  K
Subjt:  SELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPK

AT4G01360.1 unknown protein2.7e-5638.6Show/hide
Query:  NPFRAISPK--GSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEYLNISVKLLDICND
        NPF+ +  K   ++LS +L+ LL  FE +L   +++L PK +NDI+S SWM  A++ L ETH  ++TL+++L  PVSD +E ++  + + S+K  ++CN 
Subjt:  NPFRAISPK--GSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEYLNISVKLLDICND

Query:  FSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLD-------LPKVKNSSKGKVLMHVLYGVKVVTLFI
        F+SE+  L  G+L+L+ A   LE+ S N+       L  WNQH+ S+   +E+   +L  L ES+D         K K S++GKVL+ VLYGVKV TL+I
Subjt:  FSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLD-------LPKVKNSSKGKVLMHVLYGVKVVTLFI

Query:  CSVFASAFSGSSKRLLPTNVP---DTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREAEKL
         SVF ++FSGSSK L    +P   +   W  AF ELQ  +N EIKN + S  FT ++D++AV+  VKKL++ +Q      + VE  +  +++L    E +
Subjt:  CSVFASAFSGSSKRLLPTNVP---DTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREAEKL

Query:  TQGVDRLTKQVDEFFHIVLSGRDALLSNL
        ++    L+K       +V+S RDALL +L
Subjt:  TQGVDRLTKQVDEFFHIVLSGRDALLSNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAGACCACAAGAGCCGCACCGGCCCTTCTTTCCTTTTGGGAATCCTTTCCGTGCAATATCACCAAAGGGTTCAAAATTGTCTTCTAGACTTGTTTTTCTTTTAGC
TACTTTTGAGGATTCCTTGGCAGAGAGGCTGAAAAAGCTTACCCCAAAATCGGAGAATGACATACTCAGCTTCTCATGGATGGAATTAGCATTGAAGCTGCTGTACGAAA
CTCACAACGATGTAAAAACCCTTATAGAAGAGCTTGGGTTCCCCGTATCTGATTGGGATGAGAAATGGCTAGATGAGTACCTGAACATCAGTGTGAAATTACTTGATATA
TGCAATGATTTTAGCTCTGAGCTCTCACAGTTGAATCAAGGTCATCTGATACTTCGGTGTGCCTTGCACAATCTGGAATCTACATCTTCCAACCTGTTTGTTCATGCCCG
TTCTTCGCTAGATGCATGGAATCAACATATTAGTTCCAGAACCTCCAGAGTTGAAAGCTGTGCTCCTATTTTGGACTGTCTTGAGGAATCACTTGATCTTCCAAAGGTTA
AGAACTCATCCAAAGGCAAGGTTTTGATGCATGTGTTGTACGGAGTGAAGGTGGTGACTTTGTTTATTTGCAGTGTTTTTGCTTCTGCCTTCTCAGGTTCTTCCAAAAGG
TTGTTACCCACCAATGTTCCAGATACATTCAGATGGGCACACGCATTTACTGAATTACAGAAAAATGTAAATATGGAAATTAAGAATATTTATTCTAGTGGAAGGTTTAC
TGCGTTGAGAGATGTTGATGCAGTTGATGGGAGTGTAAAAAAACTACATTCCATGATTCAAGGTAATATGGATGGCTGCATGAAAGTGGAAGAATTCCAGAATTTGATTG
TAGATTTGAGGAGGGAGGCAGAGAAGCTTACACAAGGTGTTGATCGTCTTACAAAACAAGTTGATGAGTTTTTTCACATTGTTTTATCTGGACGTGATGCATTGCTTTCA
AATCTTAGAGCAAGTGAAACAGAATTTGATCGGGGAATTGGGGTGCTGTCTACAAGGCAACTGTGA
mRNA sequenceShow/hide mRNA sequence
ACATCCAATCAATTCCCTCCACGTTTCTACCTTTCGCATAATTTGCGTACCTCTCCTGTTAGGTTATCACCGGCTTTATCATCATCAATTCACAAATACAGCTCCTCTCA
TTGCCACCACGCGAAAATTCAACGTTTTCTGGCTGCGATACTTTCTTAGAAATTTCCTTTGCTTTCTTGTTCCTGAATCTGTTAGTGTTATACAATGAGCAGACCACAAG
AGCCGCACCGGCCCTTCTTTCCTTTTGGGAATCCTTTCCGTGCAATATCACCAAAGGGTTCAAAATTGTCTTCTAGACTTGTTTTTCTTTTAGCTACTTTTGAGGATTCC
TTGGCAGAGAGGCTGAAAAAGCTTACCCCAAAATCGGAGAATGACATACTCAGCTTCTCATGGATGGAATTAGCATTGAAGCTGCTGTACGAAACTCACAACGATGTAAA
AACCCTTATAGAAGAGCTTGGGTTCCCCGTATCTGATTGGGATGAGAAATGGCTAGATGAGTACCTGAACATCAGTGTGAAATTACTTGATATATGCAATGATTTTAGCT
CTGAGCTCTCACAGTTGAATCAAGGTCATCTGATACTTCGGTGTGCCTTGCACAATCTGGAATCTACATCTTCCAACCTGTTTGTTCATGCCCGTTCTTCGCTAGATGCA
TGGAATCAACATATTAGTTCCAGAACCTCCAGAGTTGAAAGCTGTGCTCCTATTTTGGACTGTCTTGAGGAATCACTTGATCTTCCAAAGGTTAAGAACTCATCCAAAGG
CAAGGTTTTGATGCATGTGTTGTACGGAGTGAAGGTGGTGACTTTGTTTATTTGCAGTGTTTTTGCTTCTGCCTTCTCAGGTTCTTCCAAAAGGTTGTTACCCACCAATG
TTCCAGATACATTCAGATGGGCACACGCATTTACTGAATTACAGAAAAATGTAAATATGGAAATTAAGAATATTTATTCTAGTGGAAGGTTTACTGCGTTGAGAGATGTT
GATGCAGTTGATGGGAGTGTAAAAAAACTACATTCCATGATTCAAGGTAATATGGATGGCTGCATGAAAGTGGAAGAATTCCAGAATTTGATTGTAGATTTGAGGAGGGA
GGCAGAGAAGCTTACACAAGGTGTTGATCGTCTTACAAAACAAGTTGATGAGTTTTTTCACATTGTTTTATCTGGACGTGATGCATTGCTTTCAAATCTTAGAGCAAGTG
AAACAGAATTTGATCGGGGAATTGGGGTGCTGTCTACAAGGCAACTGTGACGGAGAGCTTGTTCCTTTATATAAAAGGCTTCAACTGGTTATTTTACTATCTTTTGATTG
GCGATTGTACACTATCTTGTCTTACTTGTCATAGTATAGCCTGTGTAGTAATGCCTTGTGTTATCTTAATGTCATTGTGGTTTGAGCGTGTAATGTTGTACTTGGCAATG
AGAATGAAATGGAAAAGTATATCATGTTCAAGTAAAAGGCCTGCACCACTTCATCCTTCTTTATCGGTCACACTATAATTAAACTTTATTAGTTCCTTCTACTCAGTTTC
TACCTTGTTAATATCATATGCTGAAATGAATAATCCATATATATTGTTTGTGTTATGTTGGTATTCTGTATTGAATTGTTTTAGATATGTGGATACTGTGCTGTATGGAG
TTGGACTAGGAACTTACTCAATCCTCTTGTAGTCCTTCAGAATTCGAGATTATAATCTGTTGTAAAAGCTTACTTTCTGGACAATGTACTGGCTATATTTACAGTCATTG
GATTATGATCTCATGTCTCTTAATCTTTGTTTACTGATGTAGTGACAGAAGGAAAAGATCAGAAGCATGAGTTTTCATGCAGTGCAGAATTCTAGAACGCCAATCTTGTC
AGTGAACACTCGCGAGTTAGCTGTACAAATTGGCACTAGAACTGTGCTGGAAAGGCAAATACACGTCGTACCAAACATTTTTAGAAGGGTTTAATACTAACAGAAACAAT
ATACCGACTGGATATAGCGTTTGCTGGTACACAGCAAGTAAGCTTCATGCATGACTGAATTGGTAAGAGCATTTTTTCAACTCGACTGCATCAATTGGTATAGTATCCTC
TGGAAATGTGCA
Protein sequenceShow/hide protein sequence
MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLATFEDSLAERLKKLTPKSENDILSFSWMELALKLLYETHNDVKTLIEELGFPVSDWDEKWLDEYLNISVKLLDI
CNDFSSELSQLNQGHLILRCALHNLESTSSNLFVHARSSLDAWNQHISSRTSRVESCAPILDCLEESLDLPKVKNSSKGKVLMHVLYGVKVVTLFICSVFASAFSGSSKR
LLPTNVPDTFRWAHAFTELQKNVNMEIKNIYSSGRFTALRDVDAVDGSVKKLHSMIQGNMDGCMKVEEFQNLIVDLRREAEKLTQGVDRLTKQVDEFFHIVLSGRDALLS
NLRASETEFDRGIGVLSTRQL