; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017822 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017822
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionRING/U-box superfamily protein with ARM repeat domain
Genome locationtig00153056:363944..380155
RNA-Seq ExpressionSgr017822
SyntenySgr017822
Gene Ontology termsGO:0006122 - mitochondrial electron transport, ubiquinol to cytochrome c (biological process)
GO:0005750 - mitochondrial respiratory chain complex III (cellular component)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003197 - Cytochrome b-c1 complex subunit 7
IPR009057 - Homeobox-like domain superfamily
IPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold
IPR036544 - Cytochrome b-c1 complex subunit 7 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022139027.1 uncharacterized protein LOC111010055 isoform X1 [Momordica charantia]1.3e-18780.76Show/hide
Query:  MIMAYKHSQRLFICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKSL
        MI+A+K SQRLFI RLNHLNNTR GAL  N MLYH AE+SSADQE LPSEWYE A+RKI+KLSCSLKNVDLIDGRLVNV DDSTI DER+EQRMRAFKSL
Subjt:  MIMAYKHSQRLFICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKSL

Query:  VRVFVGSPSSRRRAAE--MAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQS
        VRVFVGSPS+RRR  E  MA S+ TNCQP   F N SERE MVVDSLTK+SNFLNVSAQQRKLVRHTICP   QHHIWTGAL+ MLKEL++ELDPLA+QS
Subjt:  VRVFVGSPSSRRRAAE--MAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQS

Query:  P-NKGIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTD-KSIGY
          NKGIKMGQQIVSSCLKFLDDA+NSNAHFTSWMRPAP Q VVD S SPRWED+LEMF+DL+ SLK EK LL +V KLEVMKEGLSQIKDVL+D KSIG+
Subjt:  P-NKGIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTD-KSIGY

Query:  KEARHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGL
        KE++HQESLVQ+KLSKTLGHSSRCLFTLL++YL GH+RD+EVDFCGG+LK VEN+KF L MGR+LS DEEK+VWNGV+QLDRAMGVFKFVWETAGMKGGL
Subjt:  KEARHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGL

Query:  ELQGHLWCVGAEERQLNFKGN
        ELQGHLW VGA++RQL++KGN
Subjt:  ELQGHLWCVGAEERQLNFKGN

XP_022940414.1 uncharacterized protein LOC111446029 [Cucurbita moschata]5.9e-18880.19Show/hide
Query:  MIMAYKHSQRL-FICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKS
        MIMAYKHSQRL F+CR+ HLN TR  A   N MLYHC EDS  D E LP++WYEKAF KIKKLSCSLKNVDLIDGRLVNVNDDSTI+DER+EQRMR FKS
Subjt:  MIMAYKHSQRL-FICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKS

Query:  LVRVFVGSPSSRRRAAEMAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQSP
        LVRVF+GSPS +RR  EMAASTATN QP+  FRN SERE MVVDSLTKVSNFLNVSAQQRKLVRHTICP   QHHIWTGAL+ +LKEL+MELDPLA+ SP
Subjt:  LVRVFVGSPSSRRRAAEMAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQSP

Query:  NKGIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKEA
        NKGIKMGQQIVSSCL FL+DA+NSNAH TSWMRPAPLQ  VDSS SP+WED+LEMF DL+ +LKDEK L  YVTKLEVMKEGL+QI+DVLTDKSIG+KEA
Subjt:  NKGIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKEA

Query:  RHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVE-NDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLEL
        +HQESLVQKKLSKTLGHSSRCLFTLLLYYL GH RD+EVD CGGLLKAVE  +K+ +FMGR+LS DEE++VWNGVRQLDRAMG+FKFVWETAGMKG L L
Subjt:  RHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVE-NDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLEL

Query:  QGHLWCVGAEERQLNFKGN
        QGHL+CVGAE+RQL++KGN
Subjt:  QGHLWCVGAEERQLNFKGN

XP_022982191.1 uncharacterized protein LOC111481093 [Cucurbita maxima]7.0e-18980.14Show/hide
Query:  MIMAYKHSQRL-FICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKS
        MIMAYKHSQRL F+CR+ HLN TRC AL  N MLYHC+EDS  DQE LP++WYEKAF KIKKLSCSLKNVDLIDGRLVNVNDDSTI+DER+EQRMR FKS
Subjt:  MIMAYKHSQRL-FICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKS

Query:  LVRVFVGSPSSRRRAAEMAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQSP
        LVRVF+GS S +RR  EMAAST  N QP+A FRN SERE MVVDS TKVSNFLNVSAQQRKLVRHTICP   QHHIWTGAL+ +LKEL+MELDPLA+ SP
Subjt:  LVRVFVGSPSSRRRAAEMAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQSP

Query:  NKGIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKEA
        NKGIKMGQQIVSSCLKFL+DA+NSNAH TSWMRPAPLQ  VDSS SP+WED+LEMF DL+ +LKDEK L  YVTKLEVMKEGL+QI+DVL DKSIG+KEA
Subjt:  NKGIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKEA

Query:  RHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLELQ
        +HQESLVQKKLSKTLGHSSRCLFTLLLYYL GH RD+EVD CGGLLKAVE +K+ +FMGR+LS DEE+ VWNGVRQLDRAMG+FKFVWETAGMKG L L+
Subjt:  RHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLELQ

Query:  GHLWCVGAEERQLNFKGN
        GHL+CVGAE+RQL++KGN
Subjt:  GHLWCVGAEERQLNFKGN

XP_023524216.1 uncharacterized protein LOC111788189 [Cucurbita pepo subsp. pepo]5.0e-18779.71Show/hide
Query:  MIMAYKHSQRL-FICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKS
        MIMAYKHS RL F+CR+ HLN TRC AL  N MLYHC EDS  DQE LP++WYEKAF KIKKLS SLKNVDLIDGRLVNVNDDSTI+DER+EQRMR FKS
Subjt:  MIMAYKHSQRL-FICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKS

Query:  LVRVFVGSPSSRRRAAEMAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQSP
        LVRVF+GSPS +RR  EMAASTATN QP+A FRN SERE MVVDSLTKVSNFLNVSAQQRKLVRHTICP   QHHIWTGAL+ +LKEL+MELDP A+ SP
Subjt:  LVRVFVGSPSSRRRAAEMAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQSP

Query:  NKGIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKEA
        N+GIKMGQQIVSSCL FL+DA+NSN H TSWMRPAPLQ  VDSS SP+WED+LEMF DL+ +LKDEK L  YVTKLEVMKEGL+QI+DVLTDKSIG+KEA
Subjt:  NKGIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKEA

Query:  RHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLELQ
        +HQESLVQKKLSKTLGHSSRCLFTLLLYYL GH RD+EVD CGGLLKA E +K+ +FMGR+LS DEE++VWNGVRQLDRAMG+FKFVWETAGMKG L LQ
Subjt:  RHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLELQ

Query:  GHLWCVGAEE-RQLNFKGN
        GHL+CVGAE+ RQL++KGN
Subjt:  GHLWCVGAEE-RQLNFKGN

XP_038896888.1 uncharacterized protein LOC120085101 isoform X1 [Benincasa hispida]5.2e-19282.1Show/hide
Query:  MIMAYKHSQRL-FICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKS
        MIMAYK+ QRL FI RL HLNNTR GAL+ N+MLYHCAE SSADQE LPSEWYE AFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDE +EQRMR FKS
Subjt:  MIMAYKHSQRL-FICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKS

Query:  LVRVFVGSPSSRRRAAEMAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQSP
        LV V +GSP++RRR  EMA S++  CQP A FRN SERE M+VDSLTK+SNFLNVSAQQRKLVRHTICP   QHHIWTGAL+ MLKEL +EL PL+ QS 
Subjt:  LVRVFVGSPSSRRRAAEMAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQSP

Query:  NKGIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQ-AVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKE
        NKGIKMG QIVSSCLKFLDDA+NSNAHFTSWMRPAPL+ AVVDSS  PRWED+LEMF DL+D LK+EKCL+HYVTKL+VMKEGLSQIKDVLTDKSIGYKE
Subjt:  NKGIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQ-AVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKE

Query:  ARHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLEL
        A HQESLVQKKLSKTLGHSSRCLFTLLLYY+ GH RD+EVD CGGLLKA  NDKF LFMGRVLSSDEEKIVWNG+RQLDR MG+FKFVWETAGMKG LEL
Subjt:  ARHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLEL

Query:  QGHLWCVGAEERQLNFKGN
        QGHL+CVG E+RQL++KGN
Subjt:  QGHLWCVGAEERQLNFKGN

TrEMBL top hitse value%identityAlignment
A0A0A0LED2 Uncharacterized protein1.1e-17475.24Show/hide
Query:  MAYKHSQRL-FICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKSLV
        M Y   QRL FI RL HL NTRCGA + N+MLYH AEDSSA QE LPSEWYEKAF KIKKLSC L+NVDL+DGR+VN +DDSTI DER+EQ MR FKSLV
Subjt:  MAYKHSQRL-FICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKSLV

Query:  RVFVGSPSSRRRAAEMAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQSPNK
        R+ +GSPS++RR  E+A S++ NCQP A FRN SEREAMVVDSLTKV N L V+ QQRKLVRHTICP   QHHIWTGAL+Q+LKEL +EL PL+++S +K
Subjt:  RVFVGSPSSRRRAAEMAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQSPNK

Query:  GIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKEARH
        GIKM  QIVSSCLKFLD A+NSN HF+SW+RPAP + VV SS  PRWED+LEMFNDL+  LKDEK L+HYVTKLEVMKEGLSQIKDV +D+SIG++EA+ 
Subjt:  GIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKEARH

Query:  QESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLELQGH
        QESLVQKKLSKTLGHSSRCLFTLLLYYL GH RD+EVDFCGGLLK   NDKF LFMGRVLS DEEKIVWNGVRQLDRAMG+FK VWETAGMKG L L+GH
Subjt:  QESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLELQGH

Query:  LWCVGAEERQLNFKGN
        L+CVG E RQL++KGN
Subjt:  LWCVGAEERQLNFKGN

A0A1S3CR45 uncharacterized protein LOC1035038107.5e-17374.7Show/hide
Query:  MIMAYKHSQ-RLFICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKS
        MIM Y H Q R FI RL HL +TRCGA + N+MLYH  E SS DQE LPSEWYEKAF KIKKLSC L+NVDL+DGR+VN +DDSTIIDER+EQ+MR FKS
Subjt:  MIMAYKHSQ-RLFICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKS

Query:  LVRVFVGSPSSRRRAAEMAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQSP
        LVR+ +GSPS++RR  EMA S++ N Q  A FRN SEREAMVVDSLTK  NFL V+ QQRKL+RHTICP   QHHIWTGAL+Q+LKEL +EL PL+ +S 
Subjt:  LVRVFVGSPSSRRRAAEMAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQSP

Query:  NKGIKMGQQIVSSCLKFLDDASNSNAHFTS-WMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKE
        NKGI M  QIVSSCLKFLDDA+NSN HFTS W+RPAP + +V+SS  PRWED+LEMFNDL+  LKDEK L+HYVTKLEVMKEGLSQIKDV +D+SIG+KE
Subjt:  NKGIKMGQQIVSSCLKFLDDASNSNAHFTS-WMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKE

Query:  ARHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLEL
        A+ QESLVQKKLSKTLGHSSRCLFTLLLYYL GH RD+EVDFCGGLLK   NDKF LFMGRVLS DEEKIVWNGVRQLDRAMG+FK VWETAGMKG L L
Subjt:  ARHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLEL

Query:  QGHLWCVGAEERQLNFKGN
        QGHL+CV  E RQL++KGN
Subjt:  QGHLWCVGAEERQLNFKGN

A0A6J1CBU2 uncharacterized protein LOC111010055 isoform X16.4e-18880.76Show/hide
Query:  MIMAYKHSQRLFICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKSL
        MI+A+K SQRLFI RLNHLNNTR GAL  N MLYH AE+SSADQE LPSEWYE A+RKI+KLSCSLKNVDLIDGRLVNV DDSTI DER+EQRMRAFKSL
Subjt:  MIMAYKHSQRLFICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKSL

Query:  VRVFVGSPSSRRRAAE--MAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQS
        VRVFVGSPS+RRR  E  MA S+ TNCQP   F N SERE MVVDSLTK+SNFLNVSAQQRKLVRHTICP   QHHIWTGAL+ MLKEL++ELDPLA+QS
Subjt:  VRVFVGSPSSRRRAAE--MAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQS

Query:  P-NKGIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTD-KSIGY
          NKGIKMGQQIVSSCLKFLDDA+NSNAHFTSWMRPAP Q VVD S SPRWED+LEMF+DL+ SLK EK LL +V KLEVMKEGLSQIKDVL+D KSIG+
Subjt:  P-NKGIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTD-KSIGY

Query:  KEARHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGL
        KE++HQESLVQ+KLSKTLGHSSRCLFTLL++YL GH+RD+EVDFCGG+LK VEN+KF L MGR+LS DEEK+VWNGV+QLDRAMGVFKFVWETAGMKGGL
Subjt:  KEARHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGL

Query:  ELQGHLWCVGAEERQLNFKGN
        ELQGHLW VGA++RQL++KGN
Subjt:  ELQGHLWCVGAEERQLNFKGN

A0A6J1FK17 uncharacterized protein LOC1114460292.9e-18880.19Show/hide
Query:  MIMAYKHSQRL-FICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKS
        MIMAYKHSQRL F+CR+ HLN TR  A   N MLYHC EDS  D E LP++WYEKAF KIKKLSCSLKNVDLIDGRLVNVNDDSTI+DER+EQRMR FKS
Subjt:  MIMAYKHSQRL-FICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKS

Query:  LVRVFVGSPSSRRRAAEMAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQSP
        LVRVF+GSPS +RR  EMAASTATN QP+  FRN SERE MVVDSLTKVSNFLNVSAQQRKLVRHTICP   QHHIWTGAL+ +LKEL+MELDPLA+ SP
Subjt:  LVRVFVGSPSSRRRAAEMAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQSP

Query:  NKGIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKEA
        NKGIKMGQQIVSSCL FL+DA+NSNAH TSWMRPAPLQ  VDSS SP+WED+LEMF DL+ +LKDEK L  YVTKLEVMKEGL+QI+DVLTDKSIG+KEA
Subjt:  NKGIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKEA

Query:  RHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVE-NDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLEL
        +HQESLVQKKLSKTLGHSSRCLFTLLLYYL GH RD+EVD CGGLLKAVE  +K+ +FMGR+LS DEE++VWNGVRQLDRAMG+FKFVWETAGMKG L L
Subjt:  RHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVE-NDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLEL

Query:  QGHLWCVGAEERQLNFKGN
        QGHL+CVGAE+RQL++KGN
Subjt:  QGHLWCVGAEERQLNFKGN

A0A6J1IW03 uncharacterized protein LOC1114810933.4e-18980.14Show/hide
Query:  MIMAYKHSQRL-FICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKS
        MIMAYKHSQRL F+CR+ HLN TRC AL  N MLYHC+EDS  DQE LP++WYEKAF KIKKLSCSLKNVDLIDGRLVNVNDDSTI+DER+EQRMR FKS
Subjt:  MIMAYKHSQRL-FICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKS

Query:  LVRVFVGSPSSRRRAAEMAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQSP
        LVRVF+GS S +RR  EMAAST  N QP+A FRN SERE MVVDS TKVSNFLNVSAQQRKLVRHTICP   QHHIWTGAL+ +LKEL+MELDPLA+ SP
Subjt:  LVRVFVGSPSSRRRAAEMAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICP---QHHIWTGALEQMLKELRMELDPLAYQSP

Query:  NKGIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKEA
        NKGIKMGQQIVSSCLKFL+DA+NSNAH TSWMRPAPLQ  VDSS SP+WED+LEMF DL+ +LKDEK L  YVTKLEVMKEGL+QI+DVL DKSIG+KEA
Subjt:  NKGIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKEA

Query:  RHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLELQ
        +HQESLVQKKLSKTLGHSSRCLFTLLLYYL GH RD+EVD CGGLLKAVE +K+ +FMGR+LS DEE+ VWNGVRQLDRAMG+FKFVWETAGMKG L L+
Subjt:  RHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLELQ

Query:  GHLWCVGAEERQLNFKGN
        GHL+CVGAE+RQL++KGN
Subjt:  GHLWCVGAEERQLNFKGN

SwissProt top hitse value%identityAlignment
F4JWS8 Cytochrome b-c1 complex subunit 7-2, mitochondrial1.1e-3072.16Show/hide
Query:  ESFIDPKKNWFARQHMKALSQRLRNYGLRYDDLYDPYYDLDVKEALNRLPREIVDARNQRLKRAMDLSMKHKYLP-------RISRSYLQDMLALEK
        +  +DP+KN+ AR HMK++S RLR YGLRYDDLYDP YDLD+KEALNRLPREIVDARNQRL RAMDLSMKH+YLP          RSYLQDMLAL K
Subjt:  ESFIDPKKNWFARQHMKALSQRLRNYGLRYDDLYDPYYDLDVKEALNRLPREIVDARNQRLKRAMDLSMKHKYLP-------RISRSYLQDMLALEK

P48502 Cytochrome b-c1 complex subunit 75.8e-2974.47Show/hide
Query:  IDPKKNWFARQHMKALSQRLRNYGLRYDDLYDPYYDLDVKEALNRLPREIVDARNQRLKRAMDLSMKHKYLPR-------ISRSYLQDMLALEK
        +DPKKN  A  HMK LS RLRNYGLR+DDLYDP YDLDVKEALNRLPREIVDARNQRL RAMDLSMKH+YLP          R+YLQ+MLAL K
Subjt:  IDPKKNWFARQHMKALSQRLRNYGLRYDDLYDPYYDLDVKEALNRLPREIVDARNQRLKRAMDLSMKHKYLPR-------ISRSYLQDMLALEK

Q8H0V5 Protein OVEREXPRESSOR OF CATIONIC PEROXIDASE 37.5e-6147.06Show/hide
Query:  VKLRRFGESPKVYHL-PSRYRLNTG-SVDRVSYGFLGAGEGTGIASSLPRRPPLRLTLSLTFARRRNQNSGVNPSPSSSKK-KKRNLSPKEARDKEDDEE
        +K      +  V HL P  +  ++G SV+RV +    A   +    SLP   P R    L FAR +N+   V+ S SS KK KK++L   +    E++E+
Subjt:  VKLRRFGESPKVYHL-PSRYRLNTG-SVDRVSYGFLGAGEGTGIASSLPRRPPLRLTLSLTFARRRNQNSGVNPSPSSSKK-KKRNLSPKEARDKEDDEE

Query:  DVDE--DALRHCLVYWKKISRMMSEEDLSRLERELGLALGI----------------------NDDDDQEEEAEEEDLEDNEEAEMPVKLKNWQLRRLAS
          +   + L   L         +SEE+L  L  EL  ALG+                      NDDDD +++  ++D +D+EE E P KLKNWQL+RLA 
Subjt:  DVDE--DALRHCLVYWKKISRMMSEEDLSRLERELGLALGI----------------------NDDDDQEEEAEEEDLEDNEEAEMPVKLKNWQLRRLAS

Query:  ALKKGRRKTSIKSLAAELCLDRAIVLDLLREPPPNLLMLSASLPD-----------TPTPSVPETKIIQTTDEEPIGDTAEEAKVPVHVMQQRWTAQKRL
        ALK GRRKTSIK+LAAE+CLDRA VL+LLR+PPP LLMLSA+LPD           +P PS  E+   +    EP     +EA   VHVMQQRW+AQKR+
Subjt:  ALKKGRRKTSIKSLAAELCLDRAIVLDLLREPPPNLLMLSASLPD-----------TPTPSVPETKIIQTTDEEPIGDTAEEAKVPVHVMQQRWTAQKRL

Query:  KKVQVETLERVYRRTKRPTNAMISSIVQVTNLPRKRIVKWFEDKRAEDGVPDQRLPY
        KK  +ETLE+VYRR+KRPTNA++SSIVQVTNLPRKR++KWFEDKRAEDGVPD+R PY
Subjt:  KKVQVETLERVYRRTKRPTNAMISSIVQVTNLPRKRIVKWFEDKRAEDGVPDQRLPY

Q9SUU5 Cytochrome b-c1 complex subunit 7-1, mitochondrial2.8e-3172.63Show/hide
Query:  ESFIDPKKNWFARQHMKALSQRLRNYGLRYDDLYDPYYDLDVKEALNRLPREIVDARNQRLKRAMDLSMKHKYLPR-------ISRSYLQDMLAL
        ++FIDPKKN+ AR HMKA+S RLR YGLRYDDLYD YY +D+KEA+NRLPRE+VDARNQRLKRAMDLSMKH+YLP+         R YLQDMLAL
Subjt:  ESFIDPKKNWFARQHMKALSQRLRNYGLRYDDLYDPYYDLDVKEALNRLPREIVDARNQRLKRAMDLSMKHKYLPR-------ISRSYLQDMLAL

Q9ZV31 U-box domain-containing protein 121.8e-0928.79Show/hide
Query:  GSPETVKLSSSLICSLAMLDKNKAKFGVAGTIQLLVRALSIPSVPAAHHLLTSLAELGQFHGNCTLAVRSGAIPVLINVV-ESTSGEDLAGTALAVLGLL
        GS E  + +++ + SL+++D+NK   G AG I  LV  LS  S        T+L  L  F GN   AVR+G +PVL+ ++ E  SG  +   +L++L +L
Subjt:  GSPETVKLSSSLICSLAMLDKNKAKFGVAGTIQLLVRALSIPSVPAAHHLLTSLAELGQFHGNCTLAVRSGAIPVLINVV-ESTSGEDLAGTALAVLGLL

Query:  ARFEEGLRALIKTDRIVNSMVNVLKGRCLLSKEGATEILLRLFDESEGCLRDALRLPEFLGVVADLSVRGSAKAREKAALLMNKIMNSDFDTYSKADS
        +   +G   +   D  V  +V+ ++     +KE +  +L+ L   ++  L +A +L   + ++ +++  G+ + + KAA L+N+   S F+   K  S
Subjt:  ARFEEGLRALIKTDRIVNSMVNVLKGRCLLSKEGATEILLRLFDESEGCLRDALRLPEFLGVVADLSVRGSAKAREKAALLMNKIMNSDFDTYSKADS

Arabidopsis top hitse value%identityAlignment
AT4G32470.1 Cytochrome bd ubiquinol oxidase, 14kDa subunit2.0e-3272.63Show/hide
Query:  ESFIDPKKNWFARQHMKALSQRLRNYGLRYDDLYDPYYDLDVKEALNRLPREIVDARNQRLKRAMDLSMKHKYLPR-------ISRSYLQDMLAL
        ++FIDPKKN+ AR HMKA+S RLR YGLRYDDLYD YY +D+KEA+NRLPRE+VDARNQRLKRAMDLSMKH+YLP+         R YLQDMLAL
Subjt:  ESFIDPKKNWFARQHMKALSQRLRNYGLRYDDLYDPYYDLDVKEALNRLPREIVDARNQRLKRAMDLSMKHKYLPR-------ISRSYLQDMLAL

AT4G32470.2 Cytochrome bd ubiquinol oxidase, 14kDa subunit2.0e-3272.63Show/hide
Query:  ESFIDPKKNWFARQHMKALSQRLRNYGLRYDDLYDPYYDLDVKEALNRLPREIVDARNQRLKRAMDLSMKHKYLPR-------ISRSYLQDMLAL
        ++FIDPKKN+ AR HMKA+S RLR YGLRYDDLYD YY +D+KEA+NRLPRE+VDARNQRLKRAMDLSMKH+YLP+         R YLQDMLAL
Subjt:  ESFIDPKKNWFARQHMKALSQRLRNYGLRYDDLYDPYYDLDVKEALNRLPREIVDARNQRLKRAMDLSMKHKYLPR-------ISRSYLQDMLAL

AT5G11270.1 overexpressor of cationic peroxidase 35.3e-6247.06Show/hide
Query:  VKLRRFGESPKVYHL-PSRYRLNTG-SVDRVSYGFLGAGEGTGIASSLPRRPPLRLTLSLTFARRRNQNSGVNPSPSSSKK-KKRNLSPKEARDKEDDEE
        +K      +  V HL P  +  ++G SV+RV +    A   +    SLP   P R    L FAR +N+   V+ S SS KK KK++L   +    E++E+
Subjt:  VKLRRFGESPKVYHL-PSRYRLNTG-SVDRVSYGFLGAGEGTGIASSLPRRPPLRLTLSLTFARRRNQNSGVNPSPSSSKK-KKRNLSPKEARDKEDDEE

Query:  DVDE--DALRHCLVYWKKISRMMSEEDLSRLERELGLALGI----------------------NDDDDQEEEAEEEDLEDNEEAEMPVKLKNWQLRRLAS
          +   + L   L         +SEE+L  L  EL  ALG+                      NDDDD +++  ++D +D+EE E P KLKNWQL+RLA 
Subjt:  DVDE--DALRHCLVYWKKISRMMSEEDLSRLERELGLALGI----------------------NDDDDQEEEAEEEDLEDNEEAEMPVKLKNWQLRRLAS

Query:  ALKKGRRKTSIKSLAAELCLDRAIVLDLLREPPPNLLMLSASLPD-----------TPTPSVPETKIIQTTDEEPIGDTAEEAKVPVHVMQQRWTAQKRL
        ALK GRRKTSIK+LAAE+CLDRA VL+LLR+PPP LLMLSA+LPD           +P PS  E+   +    EP     +EA   VHVMQQRW+AQKR+
Subjt:  ALKKGRRKTSIKSLAAELCLDRAIVLDLLREPPPNLLMLSASLPD-----------TPTPSVPETKIIQTTDEEPIGDTAEEAKVPVHVMQQRWTAQKRL

Query:  KKVQVETLERVYRRTKRPTNAMISSIVQVTNLPRKRIVKWFEDKRAEDGVPDQRLPY
        KK  +ETLE+VYRR+KRPTNA++SSIVQVTNLPRKR++KWFEDKRAEDGVPD+R PY
Subjt:  KKVQVETLERVYRRTKRPTNAMISSIVQVTNLPRKRIVKWFEDKRAEDGVPDQRLPY

AT5G25450.1 Cytochrome bd ubiquinol oxidase, 14kDa subunit7.5e-3272.16Show/hide
Query:  ESFIDPKKNWFARQHMKALSQRLRNYGLRYDDLYDPYYDLDVKEALNRLPREIVDARNQRLKRAMDLSMKHKYLP-------RISRSYLQDMLALEK
        +  +DP+KN+ AR HMK++S RLR YGLRYDDLYDP YDLD+KEALNRLPREIVDARNQRL RAMDLSMKH+YLP          RSYLQDMLAL K
Subjt:  ESFIDPKKNWFARQHMKALSQRLRNYGLRYDDLYDPYYDLDVKEALNRLPREIVDARNQRLKRAMDLSMKHKYLP-------RISRSYLQDMLALEK

AT5G25500.1 unknown protein2.5e-9648.86Show/hide
Query:  MLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKSLVRVFVGSPSSRRRAAEMAASTATNCQPRARF
        +LYH + DS  D   LP EWYE     +KKL+ +L++VDL+DG+L ++N    + D+ + ++M+AFKSL R+F+GSPS +++  E             RF
Subjt:  MLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDLIDGRLVNVNDDSTIIDERVEQRMRAFKSLVRVFVGSPSSRRRAAEMAASTATNCQPRARF

Query:  RNP-----SEREAMVVDSLTKVSNFLNVSAQQRKLVRHTIC---PQHHIWTGALEQMLKELRMELDPLA-YQSPNKGIKMGQQIVSSCLKFLDDASNS--
        + P     SERE +VV+SLTKV NFLNVSAQQRKLVR T+C    Q+ IW G LE +L  L+ E+D L  ++  ++G  + QQ++ SCL+FL ++S S  
Subjt:  RNP-----SEREAMVVDSLTKVSNFLNVSAQQRKLVRHTIC---PQHHIWTGALEQMLKELRMELDPLA-YQSPNKGIKMGQQIVSSCLKFLDDASNS--

Query:  NAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLK--DEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKEARHQESLVQKKLSKTLGHSSRCL
            TSWMRP P +    ++ S +WED+L+M NDL   L+  +E  +L+++ KL  MKEGL QIKDV  D +IG++E RHQE LV +KLSK LG  S CL
Subjt:  NAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLK--DEKCLLHYVTKLEVMKEGLSQIKDVLTDKSIGYKEARHQESLVQKKLSKTLGHSSRCL

Query:  FTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLELQGHLWCVGAEERQLNFKG
        F L++Y+L G VRD+EVD CGG  K  +++  CL MGR+L+S +EK++  G++QLDRA+G+F+FVWETAGMK  L LQGHLWC+GAEER + ++G
Subjt:  FTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLELQGHLWCVGAEERQLNFKG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAAGAGATCAAATGGTTGTTCTAGCTCTTACAGCGGCACCAGTGACGACTATGTAGAGATCTGTTGGTGCTTCATTCTGCAATATGATGTCTTCTCTGGGAGGGA
AGTACTCGGCCTTCATTTCTGTCACCTTCCAAGTATGAAGGGTAATTTTGAAATAATCAAGAGGCTGGATATAACAAAATTGAAAGTGATGCTGAACGTGAAAGCATGCG
CAAGAAATGAGCTTCAAACTTACAAATTTTCTGGTTCGACTGGTGCCATGGACGACCAAGTTAGTCATATTTCCAATCAGGTAAGCTTGCAATCCCAGAATAAAAAACAG
GTAACAGATGACAAAGGCCATCTCTTGATCATTTACAGGATGCAAATCGCCATAGCCAACGGATGTGATCGTCGATTTCTGATATCTGTGCCAATTTTCTTACCTGGAGA
ACATGGAACTGACTCGCCGGAGACGCCAGAGACGAAGCATATTAAAGTACCCATAAGTCTGAAGAGGAGGTGGGAGAATACTCCGGGCGAGTTCCGACGGAATAATAGAG
ACGACGTCGAAGACCAACCAGGTTTTGGCATATCGCAGAGCAATCAGCTTAGGCTCGTCGACGAGCAGATAAGAAGTTTTGTCCAAATATGCGACGAAGAAAGTCAAAAT
TATATCAATGGCGAAAATCGCATTTACAACATTGTCAATTATAGCAAGAGGTCCAGAAGGATCCTCCAGAAATCCAAACTCAAACGGGCAAACCCATGCCGTGTAGAGCA
CAGAAAAATCAGAAATGTCTCCCAAAGTCTGCAGCATCAAATCATCAAAATAGGAGGAAGGATTCCGCCGGTGAGGCTGTATTGGCTGCCGTCGTCTCTGGACAATTGCT
CCATATCCTGCTCCGGCTCCCGCCCGCACATTGATCTTCCTAACAATCCCCTCTTCTTCTTGTCTTCAATTCACAATCTCTCCCTCCATTGCCATCTCCTTAGACCCGCT
TCAATTCCAGCAAATCTCTCTCTCGACGAGCGCGCTGCCAAAATCTTGCGCTGCATTCAAGCTTCTTCTAAATTTGGATTCGCCGGAACAGAAAGAGAAAGGAAAGGGGC
CATCGCCGTGGCCGTCGCTCTCCTCTCGTTCTGGCTCCCCCATTATAACCACCGCCTGAATCTGTTGTTGGAAGTTATGGAAGTGACATTCCAATTTGGATTTGACGTTG
CATTTAACATGCATGCCTCTGTAGAATACTCTAATTTCGGAGTCGTCGTCAAAATCTGCAAGTTCACAATTAGGCCAGGCGATTCTTCCAGAAAACCGGAGATTGGCGAA
GCAGAATCGTTCATAGACCCGAAGAAGAATTGGTTCGCTCGTCAACACATGAAAGCGCTCTCCCAGCGCCTCCGTAATTATGGTCTTCGATACGACGATCTGTACGATCC
GTACTACGATCTTGATGTGAAGGAGGCTTTGAATCGACTCCCGAGGGAGATTGTGGATGCGCGCAATCAGCGTCTGAAGCGGGCCATGGACCTTTCCATGAAGCATAAGT
ACCTTCCGAGAATCTCCAGGAGTTATCTCCAGGATATGCTGGCACTTGAGAAAGCAGTGCTAATGAAGCTTAAAATCCTTTTTTGTTCTGCTTATGTTTTACAGGTAAAG
AAGGAGAGAGCAGAACTTTATAGTGGGATCAAGAATTCATTCAGTCTACGCTTTTCCGGCGACGTGGAATTTCCGATTCCCCAAGTGGACTCGGCGAGTGAGAACCGAAA
ATGTGAGGTCGGTGTCTCCTGGCAAGATCTGACTGCTGTTGATGGCAGAGATGGCATCGTCCTCAGGAAGGAACAAACTCTGGCCTCTTCATCATCCTTCTTCATGTCTG
TTTCATCTTTCCATTCATCATCATCCATGAGGTCCATGACTGTTGCAACTGCACATAAAGCTATAACACAGTGTGTAGCTGATGCTCGGTCGGACGCCCATGAAGTTCAG
GAAAAGGCTCTTCAAAACTTGGTTTTCATTACTCAGCTTCTTGCATCAATGGAAACCATCTACCATCTCAACACGCTCGTGTCCTTGGGCTCCCCCGAAACCGTCAAGCT
GTCGTCGTCTTTGATCTGTAGCCTTGCAATGCTAGACAAGAACAAGGCAAAGTTTGGGGTAGCAGGGACCATACAGTTATTGGTTAGAGCACTTTCAATCCCTAGTGTTC
CTGCTGCTCATCACCTCCTCACTTCTTTAGCTGAACTAGGCCAGTTTCATGGAAACTGCACTTTGGCAGTCCGATCAGGAGCCATACCGGTTCTCATCAACGTTGTAGAA
AGTACTAGCGGAGAGGATCTCGCAGGCACTGCTCTTGCTGTTCTTGGTCTCTTGGCTAGATTTGAGGAGGGGTTGAGGGCTTTGATAAAAACTGATCGGATTGTTAATTC
AATGGTTAATGTGCTGAAAGGAAGGTGTTTGTTGAGTAAAGAAGGTGCAACCGAGATCCTTTTGCGATTGTTCGACGAAAGTGAAGGTTGTCTGAGAGATGCTTTGAGGT
TGCCGGAGTTTTTGGGTGTTGTTGCTGATCTTTCTGTCAGAGGATCTGCAAAAGCTAGAGAGAAAGCTGCTTTGCTTATGAATAAGATCATGAATAGTGACTTTGATACA
TATTCAAAAGCAGATTCAGTGTATTCACAATGTCTGGTTAGACCAGGATTTCAACAGCGTTTGATTTTTCGTCTTCATTTCTTTAATGGTGTGCCCTCCATGTGTTCGAC
AAAATGTATGATTATGGCGTATAAGCATTCTCAGAGGTTGTTCATCTGTCGCCTAAACCATCTTAACAACACGAGATGTGGAGCTTTGCGTTTAAATGCGATGCTATATC
ACTGCGCAGAGGACTCCTCTGCCGATCAAGAGCCGTTACCCTCTGAATGGTACGAGAAGGCGTTTCGGAAGATAAAGAAACTGAGCTGCTCGCTGAAGAATGTGGATCTG
ATCGATGGACGCCTTGTTAATGTTAACGATGATTCAACCATTATCGACGAGCGTGTTGAACAGAGAATGCGTGCTTTCAAGTCCCTTGTAAGAGTGTTCGTTGGTTCTCC
ATCATCTCGGAGGAGAGCAGCAGAAATGGCTGCATCGACTGCTACAAATTGCCAGCCACGCGCACGCTTCAGAAATCCAAGTGAAAGAGAGGCAATGGTTGTTGATTCAC
TCACCAAAGTTAGCAACTTCCTCAACGTCTCTGCCCAACAAAGGAAACTGGTGCGCCACACCATATGCCCACAGCATCACATTTGGACTGGTGCATTGGAGCAAATGCTG
AAAGAGTTGAGAATGGAGTTGGATCCACTGGCTTATCAATCACCCAACAAAGGGATCAAAATGGGGCAGCAAATAGTTTCAAGTTGCCTGAAGTTTTTGGATGATGCCAG
CAATTCAAACGCTCACTTCACTTCATGGATGCGGCCAGCACCGTTACAAGCCGTTGTTGATTCATCTGTGTCGCCAAGATGGGAAGACATTCTCGAGATGTTCAACGATC
TGGTCGACTCTCTGAAAGACGAAAAGTGTTTGCTCCATTATGTGACAAAGCTTGAGGTGATGAAAGAGGGGCTTTCTCAAATCAAAGATGTGCTGACTGATAAAAGCATT
GGGTACAAGGAAGCCAGGCACCAAGAGAGTCTGGTGCAGAAGAAGCTTTCAAAGACACTGGGCCACTCATCCAGGTGCTTGTTCACTCTTTTACTATACTATCTTTGTGG
GCATGTTAGGGATCTTGAGGTGGATTTTTGTGGTGGGCTGTTGAAGGCTGTTGAGAATGACAAGTTTTGCTTGTTCATGGGGAGGGTTTTGAGCTCTGATGAGGAGAAAA
TCGTTTGGAATGGGGTGAGGCAGCTTGATAGAGCAATGGGGGTTTTTAAATTTGTTTGGGAAACAGCTGGAATGAAGGGAGGATTGGAATTGCAAGGCCATTTATGGTGT
GTTGGGGCTGAGGAAAGGCAGCTTAATTTTAAAGGAAATCCCGCTTCCAAACAAGAACTGCCGCATGGGGCTACAGTAAAACTGCGTCGTTTTGGAGAGAGTCCGAAGGT
TTATCATCTGCCGTCTCGATATCGGTTGAACACGGGATCGGTCGACAGAGTGAGCTATGGTTTCCTCGGCGCCGGTGAAGGCACCGGAATCGCTTCCAGTCTTCCGCGTC
GCCCTCCGCTTCGTCTCACTCTGTCTCTTACATTTGCCCGCCGTCGGAACCAGAATTCAGGAGTCAATCCGTCTCCATCGTCTTCGAAGAAAAAGAAGAGAAATTTATCT
CCTAAAGAAGCTAGAGACAAGGAGGACGACGAGGAGGATGTAGATGAGGATGCTTTGAGGCATTGTTTAGTCTATTGGAAGAAGATCTCAAGAATGATGAGCGAAGAGGA
CCTTTCCAGACTTGAACGTGAGCTAGGGTTAGCACTTGGGATCAATGATGATGATGATCAAGAAGAAGAAGCAGAAGAAGAAGATCTCGAGGATAACGAAGAAGCGGAAA
TGCCTGTAAAACTTAAGAACTGGCAACTTCGACGACTAGCCTCGGCTTTGAAAAAGGGCCGCCGTAAAACTAGCATTAAGAGTCTTGCTGCTGAGCTTTGTCTTGATAGG
GCTATCGTACTTGATTTGCTTCGTGAACCACCACCAAATCTTCTGATGTTGAGTGCTAGTCTACCAGACACTCCTACACCATCTGTTCCAGAAACTAAAATTATACAAAC
TACTGATGAAGAACCCATAGGAGATACTGCAGAAGAGGCGAAGGTGCCTGTTCATGTCATGCAACAGAGGTGGACTGCTCAAAAGAGACTGAAGAAGGTTCAAGTTGAAA
CTCTGGAAAGAGTATATAGAAGAACAAAGCGGCCCACTAATGCGATGATTAGTAGCATCGTCCAAGTGACAAATCTGCCTCGCAAGAGAATAGTGAAATGGTTTGAAGAC
AAGCGAGCTGAAGATGGGGTTCCTGATCAACGACTGCCTTATGCTCGGTCTGCTCCTAAATCTGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATAAGAGATCAAATGGTTGTTCTAGCTCTTACAGCGGCACCAGTGACGACTATGTAGAGATCTGTTGGTGCTTCATTCTGCAATATGATGTCTTCTCTGGGAGGGA
AGTACTCGGCCTTCATTTCTGTCACCTTCCAAGTATGAAGGGTAATTTTGAAATAATCAAGAGGCTGGATATAACAAAATTGAAAGTGATGCTGAACGTGAAAGCATGCG
CAAGAAATGAGCTTCAAACTTACAAATTTTCTGGTTCGACTGGTGCCATGGACGACCAAGTTAGTCATATTTCCAATCAGGTAAGCTTGCAATCCCAGAATAAAAAACAG
GTAACAGATGACAAAGGCCATCTCTTGATCATTTACAGGATGCAAATCGCCATAGCCAACGGATGTGATCGTCGATTTCTGATATCTGTGCCAATTTTCTTACCTGGAGA
ACATGGAACTGACTCGCCGGAGACGCCAGAGACGAAGCATATTAAAGTACCCATAAGTCTGAAGAGGAGGTGGGAGAATACTCCGGGCGAGTTCCGACGGAATAATAGAG
ACGACGTCGAAGACCAACCAGGTTTTGGCATATCGCAGAGCAATCAGCTTAGGCTCGTCGACGAGCAGATAAGAAGTTTTGTCCAAATATGCGACGAAGAAAGTCAAAAT
TATATCAATGGCGAAAATCGCATTTACAACATTGTCAATTATAGCAAGAGGTCCAGAAGGATCCTCCAGAAATCCAAACTCAAACGGGCAAACCCATGCCGTGTAGAGCA
CAGAAAAATCAGAAATGTCTCCCAAAGTCTGCAGCATCAAATCATCAAAATAGGAGGAAGGATTCCGCCGGTGAGGCTGTATTGGCTGCCGTCGTCTCTGGACAATTGCT
CCATATCCTGCTCCGGCTCCCGCCCGCACATTGATCTTCCTAACAATCCCCTCTTCTTCTTGTCTTCAATTCACAATCTCTCCCTCCATTGCCATCTCCTTAGACCCGCT
TCAATTCCAGCAAATCTCTCTCTCGACGAGCGCGCTGCCAAAATCTTGCGCTGCATTCAAGCTTCTTCTAAATTTGGATTCGCCGGAACAGAAAGAGAAAGGAAAGGGGC
CATCGCCGTGGCCGTCGCTCTCCTCTCGTTCTGGCTCCCCCATTATAACCACCGCCTGAATCTGTTGTTGGAAGTTATGGAAGTGACATTCCAATTTGGATTTGACGTTG
CATTTAACATGCATGCCTCTGTAGAATACTCTAATTTCGGAGTCGTCGTCAAAATCTGCAAGTTCACAATTAGGCCAGGCGATTCTTCCAGAAAACCGGAGATTGGCGAA
GCAGAATCGTTCATAGACCCGAAGAAGAATTGGTTCGCTCGTCAACACATGAAAGCGCTCTCCCAGCGCCTCCGTAATTATGGTCTTCGATACGACGATCTGTACGATCC
GTACTACGATCTTGATGTGAAGGAGGCTTTGAATCGACTCCCGAGGGAGATTGTGGATGCGCGCAATCAGCGTCTGAAGCGGGCCATGGACCTTTCCATGAAGCATAAGT
ACCTTCCGAGAATCTCCAGGAGTTATCTCCAGGATATGCTGGCACTTGAGAAAGCAGTGCTAATGAAGCTTAAAATCCTTTTTTGTTCTGCTTATGTTTTACAGGTAAAG
AAGGAGAGAGCAGAACTTTATAGTGGGATCAAGAATTCATTCAGTCTACGCTTTTCCGGCGACGTGGAATTTCCGATTCCCCAAGTGGACTCGGCGAGTGAGAACCGAAA
ATGTGAGGTCGGTGTCTCCTGGCAAGATCTGACTGCTGTTGATGGCAGAGATGGCATCGTCCTCAGGAAGGAACAAACTCTGGCCTCTTCATCATCCTTCTTCATGTCTG
TTTCATCTTTCCATTCATCATCATCCATGAGGTCCATGACTGTTGCAACTGCACATAAAGCTATAACACAGTGTGTAGCTGATGCTCGGTCGGACGCCCATGAAGTTCAG
GAAAAGGCTCTTCAAAACTTGGTTTTCATTACTCAGCTTCTTGCATCAATGGAAACCATCTACCATCTCAACACGCTCGTGTCCTTGGGCTCCCCCGAAACCGTCAAGCT
GTCGTCGTCTTTGATCTGTAGCCTTGCAATGCTAGACAAGAACAAGGCAAAGTTTGGGGTAGCAGGGACCATACAGTTATTGGTTAGAGCACTTTCAATCCCTAGTGTTC
CTGCTGCTCATCACCTCCTCACTTCTTTAGCTGAACTAGGCCAGTTTCATGGAAACTGCACTTTGGCAGTCCGATCAGGAGCCATACCGGTTCTCATCAACGTTGTAGAA
AGTACTAGCGGAGAGGATCTCGCAGGCACTGCTCTTGCTGTTCTTGGTCTCTTGGCTAGATTTGAGGAGGGGTTGAGGGCTTTGATAAAAACTGATCGGATTGTTAATTC
AATGGTTAATGTGCTGAAAGGAAGGTGTTTGTTGAGTAAAGAAGGTGCAACCGAGATCCTTTTGCGATTGTTCGACGAAAGTGAAGGTTGTCTGAGAGATGCTTTGAGGT
TGCCGGAGTTTTTGGGTGTTGTTGCTGATCTTTCTGTCAGAGGATCTGCAAAAGCTAGAGAGAAAGCTGCTTTGCTTATGAATAAGATCATGAATAGTGACTTTGATACA
TATTCAAAAGCAGATTCAGTGTATTCACAATGTCTGGTTAGACCAGGATTTCAACAGCGTTTGATTTTTCGTCTTCATTTCTTTAATGGTGTGCCCTCCATGTGTTCGAC
AAAATGTATGATTATGGCGTATAAGCATTCTCAGAGGTTGTTCATCTGTCGCCTAAACCATCTTAACAACACGAGATGTGGAGCTTTGCGTTTAAATGCGATGCTATATC
ACTGCGCAGAGGACTCCTCTGCCGATCAAGAGCCGTTACCCTCTGAATGGTACGAGAAGGCGTTTCGGAAGATAAAGAAACTGAGCTGCTCGCTGAAGAATGTGGATCTG
ATCGATGGACGCCTTGTTAATGTTAACGATGATTCAACCATTATCGACGAGCGTGTTGAACAGAGAATGCGTGCTTTCAAGTCCCTTGTAAGAGTGTTCGTTGGTTCTCC
ATCATCTCGGAGGAGAGCAGCAGAAATGGCTGCATCGACTGCTACAAATTGCCAGCCACGCGCACGCTTCAGAAATCCAAGTGAAAGAGAGGCAATGGTTGTTGATTCAC
TCACCAAAGTTAGCAACTTCCTCAACGTCTCTGCCCAACAAAGGAAACTGGTGCGCCACACCATATGCCCACAGCATCACATTTGGACTGGTGCATTGGAGCAAATGCTG
AAAGAGTTGAGAATGGAGTTGGATCCACTGGCTTATCAATCACCCAACAAAGGGATCAAAATGGGGCAGCAAATAGTTTCAAGTTGCCTGAAGTTTTTGGATGATGCCAG
CAATTCAAACGCTCACTTCACTTCATGGATGCGGCCAGCACCGTTACAAGCCGTTGTTGATTCATCTGTGTCGCCAAGATGGGAAGACATTCTCGAGATGTTCAACGATC
TGGTCGACTCTCTGAAAGACGAAAAGTGTTTGCTCCATTATGTGACAAAGCTTGAGGTGATGAAAGAGGGGCTTTCTCAAATCAAAGATGTGCTGACTGATAAAAGCATT
GGGTACAAGGAAGCCAGGCACCAAGAGAGTCTGGTGCAGAAGAAGCTTTCAAAGACACTGGGCCACTCATCCAGGTGCTTGTTCACTCTTTTACTATACTATCTTTGTGG
GCATGTTAGGGATCTTGAGGTGGATTTTTGTGGTGGGCTGTTGAAGGCTGTTGAGAATGACAAGTTTTGCTTGTTCATGGGGAGGGTTTTGAGCTCTGATGAGGAGAAAA
TCGTTTGGAATGGGGTGAGGCAGCTTGATAGAGCAATGGGGGTTTTTAAATTTGTTTGGGAAACAGCTGGAATGAAGGGAGGATTGGAATTGCAAGGCCATTTATGGTGT
GTTGGGGCTGAGGAAAGGCAGCTTAATTTTAAAGGAAATCCCGCTTCCAAACAAGAACTGCCGCATGGGGCTACAGTAAAACTGCGTCGTTTTGGAGAGAGTCCGAAGGT
TTATCATCTGCCGTCTCGATATCGGTTGAACACGGGATCGGTCGACAGAGTGAGCTATGGTTTCCTCGGCGCCGGTGAAGGCACCGGAATCGCTTCCAGTCTTCCGCGTC
GCCCTCCGCTTCGTCTCACTCTGTCTCTTACATTTGCCCGCCGTCGGAACCAGAATTCAGGAGTCAATCCGTCTCCATCGTCTTCGAAGAAAAAGAAGAGAAATTTATCT
CCTAAAGAAGCTAGAGACAAGGAGGACGACGAGGAGGATGTAGATGAGGATGCTTTGAGGCATTGTTTAGTCTATTGGAAGAAGATCTCAAGAATGATGAGCGAAGAGGA
CCTTTCCAGACTTGAACGTGAGCTAGGGTTAGCACTTGGGATCAATGATGATGATGATCAAGAAGAAGAAGCAGAAGAAGAAGATCTCGAGGATAACGAAGAAGCGGAAA
TGCCTGTAAAACTTAAGAACTGGCAACTTCGACGACTAGCCTCGGCTTTGAAAAAGGGCCGCCGTAAAACTAGCATTAAGAGTCTTGCTGCTGAGCTTTGTCTTGATAGG
GCTATCGTACTTGATTTGCTTCGTGAACCACCACCAAATCTTCTGATGTTGAGTGCTAGTCTACCAGACACTCCTACACCATCTGTTCCAGAAACTAAAATTATACAAAC
TACTGATGAAGAACCCATAGGAGATACTGCAGAAGAGGCGAAGGTGCCTGTTCATGTCATGCAACAGAGGTGGACTGCTCAAAAGAGACTGAAGAAGGTTCAAGTTGAAA
CTCTGGAAAGAGTATATAGAAGAACAAAGCGGCCCACTAATGCGATGATTAGTAGCATCGTCCAAGTGACAAATCTGCCTCGCAAGAGAATAGTGAAATGGTTTGAAGAC
AAGCGAGCTGAAGATGGGGTTCCTGATCAACGACTGCCTTATGCTCGGTCTGCTCCTAAATCTGCCTGA
Protein sequenceShow/hide protein sequence
MDKRSNGCSSSYSGTSDDYVEICWCFILQYDVFSGREVLGLHFCHLPSMKGNFEIIKRLDITKLKVMLNVKACARNELQTYKFSGSTGAMDDQVSHISNQVSLQSQNKKQ
VTDDKGHLLIIYRMQIAIANGCDRRFLISVPIFLPGEHGTDSPETPETKHIKVPISLKRRWENTPGEFRRNNRDDVEDQPGFGISQSNQLRLVDEQIRSFVQICDEESQN
YINGENRIYNIVNYSKRSRRILQKSKLKRANPCRVEHRKIRNVSQSLQHQIIKIGGRIPPVRLYWLPSSLDNCSISCSGSRPHIDLPNNPLFFLSSIHNLSLHCHLLRPA
SIPANLSLDERAAKILRCIQASSKFGFAGTERERKGAIAVAVALLSFWLPHYNHRLNLLLEVMEVTFQFGFDVAFNMHASVEYSNFGVVVKICKFTIRPGDSSRKPEIGE
AESFIDPKKNWFARQHMKALSQRLRNYGLRYDDLYDPYYDLDVKEALNRLPREIVDARNQRLKRAMDLSMKHKYLPRISRSYLQDMLALEKAVLMKLKILFCSAYVLQVK
KERAELYSGIKNSFSLRFSGDVEFPIPQVDSASENRKCEVGVSWQDLTAVDGRDGIVLRKEQTLASSSSFFMSVSSFHSSSSMRSMTVATAHKAITQCVADARSDAHEVQ
EKALQNLVFITQLLASMETIYHLNTLVSLGSPETVKLSSSLICSLAMLDKNKAKFGVAGTIQLLVRALSIPSVPAAHHLLTSLAELGQFHGNCTLAVRSGAIPVLINVVE
STSGEDLAGTALAVLGLLARFEEGLRALIKTDRIVNSMVNVLKGRCLLSKEGATEILLRLFDESEGCLRDALRLPEFLGVVADLSVRGSAKAREKAALLMNKIMNSDFDT
YSKADSVYSQCLVRPGFQQRLIFRLHFFNGVPSMCSTKCMIMAYKHSQRLFICRLNHLNNTRCGALRLNAMLYHCAEDSSADQEPLPSEWYEKAFRKIKKLSCSLKNVDL
IDGRLVNVNDDSTIIDERVEQRMRAFKSLVRVFVGSPSSRRRAAEMAASTATNCQPRARFRNPSEREAMVVDSLTKVSNFLNVSAQQRKLVRHTICPQHHIWTGALEQML
KELRMELDPLAYQSPNKGIKMGQQIVSSCLKFLDDASNSNAHFTSWMRPAPLQAVVDSSVSPRWEDILEMFNDLVDSLKDEKCLLHYVTKLEVMKEGLSQIKDVLTDKSI
GYKEARHQESLVQKKLSKTLGHSSRCLFTLLLYYLCGHVRDLEVDFCGGLLKAVENDKFCLFMGRVLSSDEEKIVWNGVRQLDRAMGVFKFVWETAGMKGGLELQGHLWC
VGAEERQLNFKGNPASKQELPHGATVKLRRFGESPKVYHLPSRYRLNTGSVDRVSYGFLGAGEGTGIASSLPRRPPLRLTLSLTFARRRNQNSGVNPSPSSSKKKKRNLS
PKEARDKEDDEEDVDEDALRHCLVYWKKISRMMSEEDLSRLERELGLALGINDDDDQEEEAEEEDLEDNEEAEMPVKLKNWQLRRLASALKKGRRKTSIKSLAAELCLDR
AIVLDLLREPPPNLLMLSASLPDTPTPSVPETKIIQTTDEEPIGDTAEEAKVPVHVMQQRWTAQKRLKKVQVETLERVYRRTKRPTNAMISSIVQVTNLPRKRIVKWFED
KRAEDGVPDQRLPYARSAPKSA