; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0372 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0372
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein BPS1, chloroplastic-like
Genome locationMC05:2802851..2807860
RNA-Seq ExpressionMC05g0372
SyntenyMC05g0372
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004320 - Protein of unknown function DUF241, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019020.1 Protein BPS1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]1.82e-19381.03Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
        MS+PQEPHRPFF FGNPFRAISPKGSKLSSRLVFLLA FEDSLA++L KLTPKSDND+ SF WM LAMKLLCE HNDVKTLI DL LPVSDW EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY

Query:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK
        L ISVKLLDICNDFSS+LSQLNQG L+LRCALHNL STSSNQFV A SSLD W++HISSRTSR E+   I+D  EE LD PKVKNS KGKVLM VMYGVK
Subjt:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA
        V TLFICSVFA  FSGSSK+L  I VPDTYRWAQAFT+LQKNVN+GIK  +SSGRFT LR+L AVDE V  LHSMIQ NIDGG+K EE QNL+VD R EA
Subjt:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA

Query:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLST
        EKL+QG+DHLTKQVDEFF IVLSGRDALLSNLR+SETVFDQGM GLST
Subjt:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLST

XP_004133852.1 protein BPS1, chloroplastic [Cucumis sativus]3.20e-19379.83Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRL FLLA FEDSLAERLKKLTPKS+NDI+SFSWMELAMKLL ETHNDVKTL+E+LG PVS+W+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY

Query:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK
        L ISVKLLDICNDFSSELSQLNQG L+LRCALHNLESTSSNQ V+A SSLD W++HISSRTSR +SC  ILDSL ESLD PKVKNSSKGKVLMHV+Y VK
Subjt:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA
        V TLFICSVFAS FSGSS+ L P  VPD++RWA AFT+LQK VN+ IKKIYSSGRFT LR+++AV+E V  LHSMIQGN+D     EEFQNL+V+ RREA
Subjt:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA

Query:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL
        E L+QG+DHLTKQVDEFF IVLSGRD LLSNLRA+E VF QGMGGL T RQL
Subjt:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL

XP_008438008.1 PREDICTED: protein BPS1, chloroplastic-like [Cucumis melo]6.45e-19380.11Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRLVFLLA FEDSLAERLKKLTPKS+NDI+SFSWMELAMKLL ETHN+VKTLIE+LG PVS+W+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY

Query:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK
        L ISVKLLDICNDFSSELSQLNQG L+LRCALHNLESTSSNQ V+A SSLD W++HISSRTSR +S   ILD L+ESLD PKVKNSSKGKVLMH +YGVK
Subjt:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA
        V TLFICSVFAS FSGSS+ L P  VPD++RWA AFT+LQK VN+ IKKIYSSGRFT LR+++AV+E V  LHSMIQGN+D     EEFQNL+V+ RREA
Subjt:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA

Query:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL
        E L+QG+DHLTKQVDEFF IVLSGRD LLSNLRASETVF QGMGGL T RQL
Subjt:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL

XP_022147420.1 protein BPS1, chloroplastic-like isoform X1 [Momordica charantia]1.60e-251100Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
        MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY

Query:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK
        LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK
Subjt:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA
        VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA
Subjt:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA

Query:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL
        EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL
Subjt:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL

XP_038877787.1 protein BPS1, chloroplastic-like [Benincasa hispida]2.32e-20082.39Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
        MSRPQEPH PFFPFGNPFRAISPKGSKLSSRLVFLLA FEDSLAERLKKLTPKS+NDI+SFSWMELAM LLCETHNDVKTLIE+LG P SDW+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY

Query:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK
        L +SVKLLDICNDFSSELSQLNQG L++RCALHNLESTSS+QFV A SSLD W++HISSRTSR ES  +ILD LEESLD PKVKNSSKGKVLMHV+YGVK
Subjt:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA
        V TLFICS FAS  SGSSK+L P  V DT+RWA AFT+LQKNVN+ IKKIYSSGR T LR+++AVDESV  LHSMIQGN+DG +K EEFQ L+VD RREA
Subjt:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA

Query:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL
        EKL+QG+DHLTKQVD FF IVLSGRDALLSNLRASETVFDQGM G ST RQL
Subjt:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL

TrEMBL top hitse value%identityAlignment
A0A0A0L3G5 Uncharacterized protein1.55e-19379.83Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRL FLLA FEDSLAERLKKLTPKS+NDI+SFSWMELAMKLL ETHNDVKTL+E+LG PVS+W+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY

Query:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK
        L ISVKLLDICNDFSSELSQLNQG L+LRCALHNLESTSSNQ V+A SSLD W++HISSRTSR +SC  ILDSL ESLD PKVKNSSKGKVLMHV+Y VK
Subjt:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA
        V TLFICSVFAS FSGSS+ L P  VPD++RWA AFT+LQK VN+ IKKIYSSGRFT LR+++AV+E V  LHSMIQGN+D     EEFQNL+V+ RREA
Subjt:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA

Query:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL
        E L+QG+DHLTKQVDEFF IVLSGRD LLSNLRA+E VF QGMGGL T RQL
Subjt:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL

A0A1S3AVG0 protein BPS1, chloroplastic-like3.12e-19380.11Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRLVFLLA FEDSLAERLKKLTPKS+NDI+SFSWMELAMKLL ETHN+VKTLIE+LG PVS+W+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY

Query:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK
        L ISVKLLDICNDFSSELSQLNQG L+LRCALHNLESTSSNQ V+A SSLD W++HISSRTSR +S   ILD L+ESLD PKVKNSSKGKVLMH +YGVK
Subjt:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA
        V TLFICSVFAS FSGSS+ L P  VPD++RWA AFT+LQK VN+ IKKIYSSGRFT LR+++AV+E V  LHSMIQGN+D     EEFQNL+V+ RREA
Subjt:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA

Query:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL
        E L+QG+DHLTKQVDEFF IVLSGRD LLSNLRASETVF QGMGGL T RQL
Subjt:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL

A0A5D3D1K6 Protein BPS13.12e-19380.11Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
        MS+PQEPHRPFFPFGNPFRAISPKG+K+SSRLVFLLA FEDSLAERLKKLTPKS+NDI+SFSWMELAMKLL ETHN+VKTLIE+LG PVS+W+EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY

Query:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK
        L ISVKLLDICNDFSSELSQLNQG L+LRCALHNLESTSSNQ V+A SSLD W++HISSRTSR +S   ILD L+ESLD PKVKNSSKGKVLMH +YGVK
Subjt:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA
        V TLFICSVFAS FSGSS+ L P  VPD++RWA AFT+LQK VN+ IKKIYSSGRFT LR+++AV+E V  LHSMIQGN+D     EEFQNL+V+ RREA
Subjt:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA

Query:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL
        E L+QG+DHLTKQVDEFF IVLSGRD LLSNLRASETVF QGMGGL T RQL
Subjt:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL

A0A6J1D2B3 protein BPS1, chloroplastic-like isoform X17.73e-252100Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
        MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY

Query:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK
        LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK
Subjt:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA
        VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA
Subjt:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA

Query:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL
        EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL
Subjt:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLSTTRQL

A0A6J1ISZ1 protein BPS1, chloroplastic-like2.75e-19280.46Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
        MS+PQEPHRPFF FGNPFRAISPKGSKLSSRLVFLLA FEDSLA++L KLTPKSDND+ SF WM LAMKLLCE HNDVKTLI DL LPVSDW EKWLDEY
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY

Query:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK
        L ISVKLLDICNDFSS+LSQLNQG L+LRCALHNL STSSNQFV A  SLD W++HISSRTSR E+   I+D LEE LD PKVKNS KGKVLM VMYGVK
Subjt:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA
        V TLFICSVFA  FSGSSK+L  I VPDTYRWAQAFT+LQKNVN+GIK  +SSGRFT LR L+AVDE V  LHSMIQ NIDGG+K EE QNL+VD R EA
Subjt:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA

Query:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLST
        EKL+QG+DHLTKQVDEFF +VLSGRDALLSNLR+SETVF QGM GLST
Subjt:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMGGLST

SwissProt top hitse value%identityAlignment
A2Z9A6 UPF0496 protein 44.4e-6436.1Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
        MSR     R FFP GNPFR + P G+ LS +L  LLA++ED+LA  L+KL P++ +D+++ SWM LA+  L E H ++  LI DL LPVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY

Query:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLES----TSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVM
        L  SVKLLDIC   SSELS+L+QGQL+L+ ALH L S     S  Q  +A  SL  W + +  R  R  SC + L  L  +L   KVKNS KGKVLM  +
Subjt:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLES----TSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVM

Query:  YGVKVETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLH-----------------------
        YG++  T+F+CS+F +V SGS K L  + VP+ + W+QAF DL   V+  + +  + G    ++ELE V+     LH                       
Subjt:  YGVKVETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLH-----------------------

Query:  ------SMIQ-GNIDGGIK----------------------------------------------------------------------------AEEFQ
              S++Q G+   G+K                                                                             EE  
Subjt:  ------SMIQ-GNIDGGIK----------------------------------------------------------------------------AEEFQ

Query:  NLVVDSRREAEKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASE
        N +    + AE L  GLD L+K+V +FF IVL+GRDALL NLR S+
Subjt:  NLVVDSRREAEKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASE

Q337C0 UPF0496 protein 46.9e-6536.1Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
        MSR     R FFP GNPFR + P G+ LS +L  LLA++ED+LA  L+KL P++ +D+++ SWM LA+  L E H ++  LI DL LPVSDW++KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY

Query:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLES----TSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVM
        L  SVKLLDIC   SSELS+L+QGQL+L+ ALH L S     S  Q  +A  SL  W + +  R +R  SC + L  L  +L   KVKNS+KGKVLM  +
Subjt:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLES----TSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVM

Query:  YGVKVETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQ--------GNI--------
        YG++  T+F+CS+F +V SGS K L  + VP+ + W+QAF DL   V+  + +  S G    ++ELE V+     LH +           N+        
Subjt:  YGVKVETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQ--------GNI--------

Query:  --------------------------DGGI----------------------------------------------------------------KAEEFQ
                                  +GGI                                                                  EE  
Subjt:  --------------------------DGGI----------------------------------------------------------------KAEEFQ

Query:  NLVVDSRREAEKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASE
        N +    + AE L  GLD L+K+V +FF IVL+GRDALL NLR S+
Subjt:  NLVVDSRREAEKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASE

Q9LMM6 Protein BPS1, chloroplastic3.2e-9454.79Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA  + KL PK  +DI++ SWM+ AM+ LCETHN +KTLI DL LPVSDWE+KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY

Query:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK
        L ISVKLLD+CN FSSEL++LNQG L+L+ ALHNLE+ S     +A SSLD W +HI S+  R E+C +IL SL ++L+ PKVKNS+KGKVLM  +YGVK
Subjt:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA
        V+TL+I  VFA+ FSGSS+ L  +TV +   WAQ+F ++Q  +N  IK I+ S   TVL+ELEAV   V  L+  IQ    G I     Q L    +   
Subjt:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA

Query:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRA
         +LS G+D ++K+VD FF I+LSGRD LL NLR+
Subjt:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRA

Arabidopsis top hitse value%identityAlignment
AT1G01550.1 Protein of unknown function (DUF793)2.2e-9554.79Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA  + KL PK  +DI++ SWM+ AM+ LCETHN +KTLI DL LPVSDWE+KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY

Query:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK
        L ISVKLLD+CN FSSEL++LNQG L+L+ ALHNLE+ S     +A SSLD W +HI S+  R E+C +IL SL ++L+ PKVKNS+KGKVLM  +YGVK
Subjt:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA
        V+TL+I  VFA+ FSGSS+ L  +TV +   WAQ+F ++Q  +N  IK I+ S   TVL+ELEAV   V  L+  IQ    G I     Q L    +   
Subjt:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA

Query:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRA
         +LS G+D ++K+VD FF I+LSGRD LL NLR+
Subjt:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRA

AT1G01550.2 Protein of unknown function (DUF793)2.2e-9554.79Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
        M+RPQ+P R FFPFGNPF+ +S K S LSS+L+ LL  FE +LA  + KL PK  +DI++ SWM+ AM+ LCETHN +KTLI DL LPVSDWE+KW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY

Query:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK
        L ISVKLLD+CN FSSEL++LNQG L+L+ ALHNLE+ S     +A SSLD W +HI S+  R E+C +IL SL ++L+ PKVKNS+KGKVLM  +YGVK
Subjt:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA
        V+TL+I  VFA+ FSGSS+ L  +TV +   WAQ+F ++Q  +N  IK I+ S   TVL+ELEAV   V  L+  IQ    G I     Q L    +   
Subjt:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA

Query:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRA
         +LS G+D ++K+VD FF I+LSGRD LL NLR+
Subjt:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRA

AT2G46080.1 CONTAINS InterPro DOMAIN/s: Protein BYPASS related (InterPro:IPR008511)5.1e-10856.4Show/hide
Query:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY
        MSRPQ+P R FFPFGNPFR +S KGS LS  L+ LL  FE  L ERLKKL PK+ +DI++ SWM+LAM+ LCETH ++ TLI DL LPVSDWEEKW+D Y
Subjt:  MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEY

Query:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK
        L ISV+LLD+CN FSSEL++LNQG L L+C LHNL+S S  +++QA SSLD W +H+++   R E+C ++LDSL +SL  PKVKNS KGKVLM   YGVK
Subjt:  LGISVKLLDICNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVK

Query:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA
        V+T++ICSVF + +S S+K LF + V +   WA+ FTD+Q  VN  I+ + SSGR T+L+ELE+VD SV  L+ MIQ  +D  ++ E F++ V++   +A
Subjt:  VETLFICSVFASVFSGSSKKLFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREA

Query:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMG
        EKLSQGLD L ++VD FF + L GRD LL NLR+S+++    +G
Subjt:  EKLSQGLDHLTKQVDEFFDIVLSGRDALLSNLRASETVFDQGMG

AT3G61500.1 unknown protein1.4e-2540.85Show/hide
Query:  FRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLP-VSDWEE-KWLDEYLGISVKLLDICNDFS
        FR + PK +        LL  FE SL ERLKKL P++ ++I++  WM LAM+LL +THND+  LI DL L  ++D E   W + Y+ I+ KLLD+CN F 
Subjt:  FRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLP-VSDWEE-KWLDEYLGISVKLLDICNDFS

Query:  SELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLD
        S L  +N G + L+   H LE  S +   + CS+LD W ++I++  S+   C  +L    ESL+
Subjt:  SELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLD

AT4G01360.1 unknown protein3.9e-5538.91Show/hide
Query:  NPFRAISPK--GSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEYLGISVKLLDICND
        NPF+ +  K   ++LS +L+ LL  FE +L   +++L PK  NDIVS SWM  AM+ LCETH  ++TL++DL +PVSD EE ++  +   S+K  ++CN 
Subjt:  NPFRAISPK--GSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEYLGISVKLLDICND

Query:  FSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLD-------PPKVKNSSKGKVLMHVMYGVKVETLFI
        F+SE+  L  G L+L+ A   LE+ S N        L  W++H+ S+    E+  ++L  L ES+D         K K S++GKVL+ V+YGVKV+TL+I
Subjt:  FSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLD-------PPKVKNSSKGKVLMHVMYGVKVETLFI

Query:  CSVFASVFSGSSKKLFPITVP---DTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREAEKL
         SVF + FSGSSK LF +T+P   +   W QAF +LQ  +N  IK  + S  FTV+++LEAV+  V  L++ +Q      +  E  +  V++       L
Subjt:  CSVFASVFSGSSKKLFPITVP---DTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREAEKL

Query:  SQGLDHLTKQVDEFFDIVLSGRDALLSNL
        S+  + ++K+      +V+S RDALL +L
Subjt:  SQGLDHLTKQVDEFFDIVLSGRDALLSNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCGACCACAAGAGCCACACCGGCCTTTCTTTCCTTTCGGAAATCCTTTTCGTGCAATATCACCTAAGGGTTCCAAATTGTCCTCTAGACTTGTCTTTTTGTTAGC
CGCTTTTGAGGATTCACTGGCAGAGAGGCTGAAAAAGCTTACTCCAAAATCAGATAATGATATAGTTAGCTTCTCATGGATGGAATTAGCAATGAAGCTGCTGTGCGAGA
CTCACAATGATGTTAAAACACTTATAGAAGACCTTGGTCTCCCTGTATCTGACTGGGAGGAGAAATGGTTAGATGAGTATTTGGGCATCAGTGTGAAATTACTTGATATA
TGCAATGATTTTAGCTCTGAGCTCTCACAGTTGAATCAGGGTCAACTGGTACTTCGGTGTGCCTTGCACAATCTGGAATCTACATCTTCCAACCAGTTTGTTCAAGCTTG
TTCTTCGTTAGATGGATGGAGTAAACATATTAGTTCCAGAACCTCCAGATTCGAGAGCTGTTGTTCTATTCTGGACAGTCTTGAGGAATCGCTCGATCCTCCGAAGGTGA
AGAACTCATCCAAAGGCAAGGTTCTGATGCATGTGATGTACGGAGTGAAGGTGGAAACTCTGTTTATTTGCAGTGTTTTTGCTTCTGTCTTCTCAGGTTCTTCCAAAAAG
TTGTTTCCTATCACTGTTCCTGATACATATAGATGGGCACAAGCTTTTACTGACTTACAGAAAAATGTAAACGTGGGAATTAAGAAAATTTATTCTAGTGGACGATTTAC
TGTATTGAGAGAGCTTGAAGCAGTTGATGAGAGTGTAACAAATTTGCATTCCATGATTCAAGGAAATATAGATGGTGGAATTAAAGCAGAAGAATTCCAGAATTTGGTTG
TAGATTCGAGGAGGGAGGCAGAAAAGCTTTCACAAGGCCTTGATCATCTTACAAAACAAGTCGACGAGTTTTTCGACATTGTTTTATCGGGACGTGATGCATTGCTTTCA
AATCTTAGAGCAAGTGAGACAGTATTTGATCAGGGAATGGGGGGGCTGTCTACAACAAGACAACTGTGA
mRNA sequenceShow/hide mRNA sequence
CTCCACTCCTCCGCCAGACGAATTCAACGTTTGCTGGCTGCAATAATTTCTTAGAAATTTCCTTTGCTTTCTGGTTTCTGAATCTGTTGGTGTTATATAATGAGCCGACC
ACAAGAGCCACACCGGCCTTTCTTTCCTTTCGGAAATCCTTTTCGTGCAATATCACCTAAGGGTTCCAAATTGTCCTCTAGACTTGTCTTTTTGTTAGCCGCTTTTGAGG
ATTCACTGGCAGAGAGGCTGAAAAAGCTTACTCCAAAATCAGATAATGATATAGTTAGCTTCTCATGGATGGAATTAGCAATGAAGCTGCTGTGCGAGACTCACAATGAT
GTTAAAACACTTATAGAAGACCTTGGTCTCCCTGTATCTGACTGGGAGGAGAAATGGTTAGATGAGTATTTGGGCATCAGTGTGAAATTACTTGATATATGCAATGATTT
TAGCTCTGAGCTCTCACAGTTGAATCAGGGTCAACTGGTACTTCGGTGTGCCTTGCACAATCTGGAATCTACATCTTCCAACCAGTTTGTTCAAGCTTGTTCTTCGTTAG
ATGGATGGAGTAAACATATTAGTTCCAGAACCTCCAGATTCGAGAGCTGTTGTTCTATTCTGGACAGTCTTGAGGAATCGCTCGATCCTCCGAAGGTGAAGAACTCATCC
AAAGGCAAGGTTCTGATGCATGTGATGTACGGAGTGAAGGTGGAAACTCTGTTTATTTGCAGTGTTTTTGCTTCTGTCTTCTCAGGTTCTTCCAAAAAGTTGTTTCCTAT
CACTGTTCCTGATACATATAGATGGGCACAAGCTTTTACTGACTTACAGAAAAATGTAAACGTGGGAATTAAGAAAATTTATTCTAGTGGACGATTTACTGTATTGAGAG
AGCTTGAAGCAGTTGATGAGAGTGTAACAAATTTGCATTCCATGATTCAAGGAAATATAGATGGTGGAATTAAAGCAGAAGAATTCCAGAATTTGGTTGTAGATTCGAGG
AGGGAGGCAGAAAAGCTTTCACAAGGCCTTGATCATCTTACAAAACAAGTCGACGAGTTTTTCGACATTGTTTTATCGGGACGTGATGCATTGCTTTCAAATCTTAGAGC
AAGTGAGACAGTATTTGATCAGGGAATGGGGGGGCTGTCTACAACAAGACAACTGTGATGGAGATGAGCTTGTTCCTTTATATAAAAGGGTTCAACTGGTTATTTTCCTA
TCTTTTGATTGGCGAATGTACACTATCTTGTCTGGCTTGTCATAGTAGTATAGCTTGTGTAGTAATGCTCTACGTTATCTTTACTATTTTTATGGTTTGATCATGTAATG
TTGTACTGTGCAATGAGAATGAAATGGAAAAGTATGTCATGTTCAAGTAAAAGGCTCACACCAGTTTAGCTTTCTTTGCCTTCCGCACTGTAATTTGTCTCAATAGTAAT
TCTTTCTGCTCAGTTCCACAAATTTTTTACCGTGTTAGTATCATATGCTAAACTGAATGATAAACACAAGTGTAAGGCTTTCTGTATTGAATTTTTTTTTGATGCAGTGT
GGATACGATGCTGTATGGAGTTGGAATAGGAGCTTACTCAAGTTGGTAAATTGTTACTGTTACTCAATCAGATTATTGATCCTCTGTAAAGTTTGCTTCCTAAACAATTG
ACAGATGGAAAAATCAGAAGCATGAGTTTCCAGTGCAGAATTCTTCAATTCCAATCTTATCATTGAACATTTGTGAGTTGGCTGTACAAATTGTCACCCTGTTTGTGAAT
TTTGCTCACGATCCTTCTCATATCTTAAAAGTCGTCCTTTTATTTTCTTTCCATTATACTAGTTTTTGGATAACTTTCTCAACTCATAGGTTCTGAAAGTAATGGTTCAA
TACAATACACTACTTGATGCCAGTGCTAGTTGATTTTGAGAGGAATTATTTCAATGGCGTTAAATGATTGCATACATTCTATAACAATAGAACTGTACTGTAAAAGCAAG
ATACACGGCGTACGGAACAGTTTTATAGATAGATTTTTATACTAACAGTGATAATACACTGACTTGATAATTGATATAGCTTTTGCTGGTGCACAGTAAGAAAGCCTCAA
ACATGACTCAATTGATACGAACATTTTTTCAACTCTACTGAATCGATTGGTAGAGTATCCTCTGGAAACTGACATGTTGTAGTCTGCTCCCCCTGATCGGGTAACAGACT
GCACGGCGGAGGAGTAACATGGCTTGTTATTCCCTCAATATCGGTGCAAGTCCATGGCTGTTTCTTGGCCTTCACTGCCATGCCGATTGTCACATTGGAAATGCATATGC
CTTTAAATGTGTCACCCGAAATTCCCTCCAATCTTGCTGCCATCGTTGCGTTTTCGACAACCATATCCCTGTAGTTGATTCCTTCTATTGCAGGCATTGCGTGGGGGTCA
TACTTCTTATCGGCATGCGACCCGTAATTTCCAGTCATCCAAAATGCCCATTTCATGGTATGCATTGTCATCCTCCTAACATATATGTCTTTAACATAGCCTCCCCTTCC
TATGCCAGTTTTGATCCTGACCCCTGATTCAGAATCGATGGCTACGATGTCTTCAGCTCGGACATCTTGGATCCCACCCGACATCTCACTTCCCAGTGCAATGACAGCAC
TTGTCGGGGAAATGCACGTGAGCCGTCTGATGATCAACTGTTTAGTTGGCCATCCGAATGCTATGCCATACTCGTCCCAACCACTCTTGACTGCCACACAATCATCTCCA
GATACTATGTAACAATCCTCAATTCTAACGTTTGTGCAGGAATCTGCAGATGAAATCAGTCATTAATTTTACAGGACGTATAAAACTTCATCTTCTCGAGTTGCTTCGAG
ATACAAACCTGGGTTGATACCATCTGTATTCGGGGATCGAACTGGAGCGATAATGGTGATTCCTTGAATAAGAACATTGCTGCATAACAGTTGCAACTGAATCAAACCAG
GAAGTGTAGAACTTCATATTATAGATTTTTCATAATTAACTCTAATTACCTGCTGTAAACAGGATGAACATTCCAAGTCGGAGCATTTAGCAGGGTCAGGTTCGATATCT
GGACATCGTTGGAATGCATGATTTCGATCAAGTACGGTCGGGTGTACTTAAGCTTGCCCTGATGAAAAAGTTGCCACCAGCGAGCGCCTTGACCGTCAATAGTGCCGTTG
TTCCCTGATATGAATCGGGTTTCGCTCAAATTTTCAGACACTCATCAAGAAGAATGACAGACAGATCAAGCTCCGGCAAGACATTGGGACACTTACCTGTAATGACAACA
TCAGTGAGGTTTGTTCCAAATAAGAGACTGATGTACCTTCCCCCCGACGTGTCCCTTCCATGACCGTAGGATGGCAATGGGTCGACCACAGGCCATTCGTTAGGATCCTA
TCAGAGTAGTTGAAACGCAGACAAATTTTAGATGAGATAATTTAGACAATTCAGCTTAGATGATTAATCTAGCCATGGTTCAATTGACAAGTCTAGTTGATTTATATGCA
AATGGGGAACATGGATCAAGGATTTCGGAGACATCAATTGAATTTTTTTAAATCTGAGGAACTCTTCTAATTTCATTGCACTTCCA
Protein sequenceShow/hide protein sequence
MSRPQEPHRPFFPFGNPFRAISPKGSKLSSRLVFLLAAFEDSLAERLKKLTPKSDNDIVSFSWMELAMKLLCETHNDVKTLIEDLGLPVSDWEEKWLDEYLGISVKLLDI
CNDFSSELSQLNQGQLVLRCALHNLESTSSNQFVQACSSLDGWSKHISSRTSRFESCCSILDSLEESLDPPKVKNSSKGKVLMHVMYGVKVETLFICSVFASVFSGSSKK
LFPITVPDTYRWAQAFTDLQKNVNVGIKKIYSSGRFTVLRELEAVDESVTNLHSMIQGNIDGGIKAEEFQNLVVDSRREAEKLSQGLDHLTKQVDEFFDIVLSGRDALLS
NLRASETVFDQGMGGLSTTRQL