; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018523 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018523
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPB1 domain-containing protein
Genome locationChr04:4899107..4901126
RNA-Seq ExpressionHG10018523
SyntenyHG10018523
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000270 - PB1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051068.1 Phox/Bem1p [Cucumis melo var. makuwa]2.6e-20078.4Show/hide
Query:  MCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEHLRL--SSSTSSRIRLF
        MCSYGGHIT R  TK+ SYLGGETRIISVDPTTVNTLS+FISHLLTILPIKPPFSLKYHLPQSALDSLISLSSD DLHFMF EHLRL  SSS+SSRIRLF
Subjt:  MCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEHLRL--SSSTSSRIRLF

Query:  IFFPEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSSFGSSSSSASLANVSSPIKPQSE
        +FFPEPEKPHNVIHHPKTEAWF DALKSAKILQKGRDCL  FDGEGL+GENE KG  DLGNGG  SLPESMVLETSSSFGSSSSSASLANVS PIK QSE
Subjt:  IFFPEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSSFGSSSSSASLANVSSPIKPQSE

Query:  DFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPFQQQELQFVQASAPVESCLPAVY
        D+GLSS+        ASD VATLASDIAPTNSCSSVEN V S+PVIS S FH+L AGV  RNPHDFSGYA   +PN FQ QELQFVQ S PVESCLP VY
Subjt:  DFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPFQQQELQFVQASAPVESCLPAVY

Query:  QMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNLPV-----------------HWGLHDAATTSSTHTLVLPDASPVVPLPQVAYKEVMPKP
        QMPSYYP QQPQF+HYQPMPN MY V++LPVG TQ+S+PSNLPV                  WGLHD AT +STH+LVLPDASPVVPLPQVAYKE+MP+P
Subjt:  QMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNLPV-----------------HWGLHDAATTSSTHTLVLPDASPVVPLPQVAYKEVMPKP

Query:  HSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQPPPPLVPSALQSKPKASTNLLSDAMAQLQMIKIKQ
        HSQNLGAMPSLANPI   SAD+VQQ P  VIIPNDVAADA  EV RT +ECN++DP RTLIYKSQP PP     LQSKPK STNLLSDAMAQLQMIKI Q
Subjt:  HSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQPPPPLVPSALQSKPKASTNLLSDAMAQLQMIKIKQ

XP_008441273.1 PREDICTED: uncharacterized protein LOC103485455 [Cucumis melo]4.2e-20677.65Show/hide
Query:  MDQQPLQSAITTPTSKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMF
        MD  P  S I +PT+KLRLMCSYGGHIT R  TK+ SYLGGETRIISVDPTTVNTLS+FISHLLTILPIKPPFSLKYHLPQSALDSLISLSSD DLHFMF
Subjt:  MDQQPLQSAITTPTSKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMF

Query:  CEHLRL--SSSTSSRIRLFIFFPEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSSFGS
         EHLRL  SSS+SSRIRLF+FFPEPEKPHNVIHHPKTEAWF DALKSAKILQKGRDCL  FDGEGL+GENE KG  DLGNGG  SLPESMVLETSSSFGS
Subjt:  CEHLRL--SSSTSSRIRLFIFFPEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSSFGS

Query:  SSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPFQQQ
        SSSSASLANVS PIK QSED+GLSS+        ASD VATLASDIAPTNSCSSVEN V S+PVIS S FH+L AGV  RNPHDFSGYA   +PN FQ Q
Subjt:  SSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPFQQQ

Query:  ELQFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNLPV-----------------HWGLHDAATTSSTHTLVLPD
        ELQFVQ S PVESCLP VYQMPSYYP QQPQF+HYQPMPN MY V++LPVG TQ+S+PSNLPV                  WGLHD AT +STH+LVLPD
Subjt:  ELQFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNLPV-----------------HWGLHDAATTSSTHTLVLPD

Query:  ASPVVPLPQVAYKEVMPKPHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQPPPPLVPSALQSKPKA
        ASPVVPLPQVAYKE+MP+PHSQNLGAMPSLANPI   SAD+VQQ P  VIIPNDVAADA  EV RT +ECN++DP RTLIYKSQP PP     LQSKPK 
Subjt:  ASPVVPLPQVAYKEVMPKPHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQPPPPLVPSALQSKPKA

Query:  STNLLSDAMAQLQMIKIKQ
        STNLLSDAMAQLQMIKI Q
Subjt:  STNLLSDAMAQLQMIKIKQ

XP_022133762.1 uncharacterized protein LOC111006259 [Momordica charantia]2.5e-18272.08Show/hide
Query:  QPLQSAITTPTSKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEH
        QP     +  + KLRLMCSYGG ITPR  TK+  YLGGETRIISVDP  VNTLSAFISHLLTIL I PPF+LKY LP SALDSLISLSSDDDL FM CEH
Subjt:  QPLQSAITTPTSKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEH

Query:  LRLSSSTS-----SRIRLFIFFPEPEKP---HNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSS
        LRLSSS S     SRIRLF+FFPEPEKP    NVIHHPKTEAW VDAL+SAKILQKGRDC   FDGEGLIGENE KG  DLG GGV SL ESMVLETSSS
Subjt:  LRLSSSTS-----SRIRLFIFFPEPEKP---HNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSS

Query:  FGSSSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPF
        FGSSSSSASLANV   IKP +EDF  SSLDN             +AS+IA T+SCSS+EN VMSIPVIS S FHD  AG+HP+N  DFSGY L PRPN F
Subjt:  FGSSSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPF

Query:  QQQELQFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNLPVHWGLHDAATTSSTHTLVLPDASPVVPLPQVAYKE
        Q Q LQFVQA  PVESCLP+++ M SYYP QQPQFLHYQPMPN MY ++FLPVG TQ+S+PSNLP+ WGL DAAT S +H LV PDASPV  L QVAYKE
Subjt:  QQQELQFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNLPVHWGLHDAATTSSTHTLVLPDASPVVPLPQVAYKE

Query:  VMPKPHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQPPPPLVPSALQSKPKASTNLLSDAMAQLQM
        V+P+PH QN GA P LANP++ E AD+V+Q+PV+  I ND A   SGEVAR RNECN+DD ART IYKSQPPPPLVPS LQSK  AST LLSDAMAQLQM
Subjt:  VMPKPHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQPPPPLVPSALQSKPKASTNLLSDAMAQLQM

Query:  IKIKQ
        IKIKQ
Subjt:  IKIKQ

XP_031736617.1 uncharacterized protein LOC101214062 [Cucumis sativus]1.2e-18962.91Show/hide
Query:  MDQQPLQSAITTPT-SKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFM
        MD  P   +   PT +KLRLMCSY GHIT R  TK+ SYLGGETRIISVDPTTVNTLS FISHLLTILPIKPPFSLKYHLP SALDSLISLSS DDLHFM
Subjt:  MDQQPLQSAITTPT-SKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFM

Query:  FCEHLRL--SSSTSSRIRLFIFFPEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSSFG
        F EHLRL  SSS+SSRIRLF+FFPEPEKPHNVIHHPKTEAWF DALKSAKILQKGRDCL  FDGEGLIGENE KG  DLGNGG  SLPESMVLETSSSFG
Subjt:  FCEHLRL--SSSTSSRIRLFIFFPEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSSFG

Query:  SSSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPFQQ
        SSSSSASLANVS PIKPQSEDFGLSS+        ASDSVATLASDI PTNSCSSVEN V S+PVI+ S FH+L AGV  RNPHDFSGYA   RPN FQ 
Subjt:  SSSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPFQQ

Query:  QELQFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNLPV------------------------------------
        QELQFVQ S PVESCLP VYQMPSYYPVQQPQF+HYQPMPN MY V++LPVG TQ+S+PSNLPV                                    
Subjt:  QELQFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNLPV------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  HWGLHDAATTSSTHTLVLPDASPVVPLPQVAYKEVMPKPHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLI
         WGLH+ AT  STH+LVLPDASPVVPLPQVAYKE+MP+ HSQN GAMPSLANP S ESAD+VQQ P  VIIPNDVAADAS EV  T +E NEDDP RTLI
Subjt:  HWGLHDAATTSSTHTLVLPDASPVVPLPQVAYKEVMPKPHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLI

Query:  YKSQPPPPLVPSALQSKPKASTNLLSDAMAQLQMIKIKQ
        YKSQP PP     LQSKP+ASTNLLSDAMAQLQMIKI Q
Subjt:  YKSQPPPPLVPSALQSKPKASTNLLSDAMAQLQMIKIKQ

XP_038884113.1 uncharacterized protein LOC120075037 [Benincasa hispida]4.7e-23486.65Show/hide
Query:  MDQQPLQSAITTPTSKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMF
        MD  P QSA T PT+KLRLMCSYGGHIT R  TK FSYLGGETRIISVDPTTVNTLSAFISHLLTILPIK PFSLKYHLP SALDSLISLSSDDDLHFMF
Subjt:  MDQQPLQSAITTPTSKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMF

Query:  CEHLRL--SSSTSSRIRLFIFFPEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSSFGS
        CEHLRL  SSS+SSRIRLF+F PEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCL  FDGEGLIGENE KG  DLGNGG  SLPESMVLETSSSFGS
Subjt:  CEHLRL--SSSTSSRIRLFIFFPEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSSFGS

Query:  SSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPFQQQ
        SSSSASLANVS P KPQ+EDFGLSSLDN A LQTASDS+ATLAS+IAPTNSCSSVEN VMSIPVIS S FH+L AGV P+NPHDFSGYAL  RPNPFQQQ
Subjt:  SSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPFQQQ

Query:  ELQFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNLPVHWGLHDAATTSSTHTLVLPDASPVVPLPQVAYKEVMP
        +LQFVQASA VESCLPAVYQMPSYYPVQQPQF+HYQPMPN +Y V+FLPVG TQ+S+PSNLPV WGL DAAT SSTHTLVLPDASPVVPLP VAYKEVMP
Subjt:  ELQFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNLPVHWGLHDAATTSSTHTLVLPDASPVVPLPQVAYKEVMP

Query:  KPHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQPPPPLVPSALQSKPKASTNLLSDAMAQLQMIKI
        +PHSQNLGAMPSLANPIS ESAD+VQQ+P  VIIPND AAD SGEVA TRNECNEDDPARTLIYKSQP PPLVPS LQSKPKASTNLLSDAMAQL MIKI
Subjt:  KPHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQPPPPLVPSALQSKPKASTNLLSDAMAQLQMIKI

Query:  KQ
        +Q
Subjt:  KQ

TrEMBL top hitse value%identityAlignment
A0A0A0LMW1 PB1 domain-containing protein1.6e-20377.31Show/hide
Query:  MDQQPLQSAITTPT-SKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFM
        MD  P   +   PT +KLRLMCSY GHIT R  TK+ SYLGGETRIISVDPTTVNTLS FISHLLTILPIKPPFSLKYHLP SALDSLISLSS DDLHFM
Subjt:  MDQQPLQSAITTPT-SKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFM

Query:  FCEHLRL--SSSTSSRIRLFIFFPEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSSFG
        F EHLRL  SSS+SSRIRLF+FFPEPEKPHNVIHHPKTEAWF DALKSAKILQKGRDCL  FDGEGLIGENE KG  DLGNGG  SLPESMVLETSSSFG
Subjt:  FCEHLRL--SSSTSSRIRLFIFFPEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSSFG

Query:  SSSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPFQQ
        SSSSSASLANVS PIKPQSEDFGLSS+        ASDSVATLASDI PTNSCSSVEN V S+PVI+ S FH+L AGV  RNPHDFSGYA   RPN FQ 
Subjt:  SSSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPFQQ

Query:  QELQFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQI-----------------SSPSNLPVHWGLHDAATTSSTHTLVLP
        QELQFVQ S PVESCLP VYQMPSYYPVQQPQF+HYQPMPN MY V++LPVG TQI                 S+PSNLP+ WGLH+ AT  STH+LVLP
Subjt:  QELQFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQI-----------------SSPSNLPVHWGLHDAATTSSTHTLVLP

Query:  DASPVVPLPQVAYKEVMPKPHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQPPPPLVPSALQSKPK
        DASPVVPLPQVAYKE+MP+ HSQN GAMPSLANP S ESAD+VQQ P  VIIPNDVAADAS EV  T +E NEDDP RTLIYKSQP PP     LQSKP+
Subjt:  DASPVVPLPQVAYKEVMPKPHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQPPPPLVPSALQSKPK

Query:  ASTNLLSDAMAQLQMIKIKQ
        ASTNLLSDAMAQLQMIKI Q
Subjt:  ASTNLLSDAMAQLQMIKIKQ

A0A1S3B318 uncharacterized protein LOC1034854552.0e-20677.65Show/hide
Query:  MDQQPLQSAITTPTSKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMF
        MD  P  S I +PT+KLRLMCSYGGHIT R  TK+ SYLGGETRIISVDPTTVNTLS+FISHLLTILPIKPPFSLKYHLPQSALDSLISLSSD DLHFMF
Subjt:  MDQQPLQSAITTPTSKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMF

Query:  CEHLRL--SSSTSSRIRLFIFFPEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSSFGS
         EHLRL  SSS+SSRIRLF+FFPEPEKPHNVIHHPKTEAWF DALKSAKILQKGRDCL  FDGEGL+GENE KG  DLGNGG  SLPESMVLETSSSFGS
Subjt:  CEHLRL--SSSTSSRIRLFIFFPEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSSFGS

Query:  SSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPFQQQ
        SSSSASLANVS PIK QSED+GLSS+        ASD VATLASDIAPTNSCSSVEN V S+PVIS S FH+L AGV  RNPHDFSGYA   +PN FQ Q
Subjt:  SSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPFQQQ

Query:  ELQFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNLPV-----------------HWGLHDAATTSSTHTLVLPD
        ELQFVQ S PVESCLP VYQMPSYYP QQPQF+HYQPMPN MY V++LPVG TQ+S+PSNLPV                  WGLHD AT +STH+LVLPD
Subjt:  ELQFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNLPV-----------------HWGLHDAATTSSTHTLVLPD

Query:  ASPVVPLPQVAYKEVMPKPHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQPPPPLVPSALQSKPKA
        ASPVVPLPQVAYKE+MP+PHSQNLGAMPSLANPI   SAD+VQQ P  VIIPNDVAADA  EV RT +ECN++DP RTLIYKSQP PP     LQSKPK 
Subjt:  ASPVVPLPQVAYKEVMPKPHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQPPPPLVPSALQSKPKA

Query:  STNLLSDAMAQLQMIKIKQ
        STNLLSDAMAQLQMIKI Q
Subjt:  STNLLSDAMAQLQMIKIKQ

A0A5D3BU91 Phox/Bem1p1.3e-20078.4Show/hide
Query:  MCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEHLRL--SSSTSSRIRLF
        MCSYGGHIT R  TK+ SYLGGETRIISVDPTTVNTLS+FISHLLTILPIKPPFSLKYHLPQSALDSLISLSSD DLHFMF EHLRL  SSS+SSRIRLF
Subjt:  MCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEHLRL--SSSTSSRIRLF

Query:  IFFPEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSSFGSSSSSASLANVSSPIKPQSE
        +FFPEPEKPHNVIHHPKTEAWF DALKSAKILQKGRDCL  FDGEGL+GENE KG  DLGNGG  SLPESMVLETSSSFGSSSSSASLANVS PIK QSE
Subjt:  IFFPEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSSFGSSSSSASLANVSSPIKPQSE

Query:  DFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPFQQQELQFVQASAPVESCLPAVY
        D+GLSS+        ASD VATLASDIAPTNSCSSVEN V S+PVIS S FH+L AGV  RNPHDFSGYA   +PN FQ QELQFVQ S PVESCLP VY
Subjt:  DFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPFQQQELQFVQASAPVESCLPAVY

Query:  QMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNLPV-----------------HWGLHDAATTSSTHTLVLPDASPVVPLPQVAYKEVMPKP
        QMPSYYP QQPQF+HYQPMPN MY V++LPVG TQ+S+PSNLPV                  WGLHD AT +STH+LVLPDASPVVPLPQVAYKE+MP+P
Subjt:  QMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNLPV-----------------HWGLHDAATTSSTHTLVLPDASPVVPLPQVAYKEVMPKP

Query:  HSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQPPPPLVPSALQSKPKASTNLLSDAMAQLQMIKIKQ
        HSQNLGAMPSLANPI   SAD+VQQ P  VIIPNDVAADA  EV RT +ECN++DP RTLIYKSQP PP     LQSKPK STNLLSDAMAQLQMIKI Q
Subjt:  HSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQPPPPLVPSALQSKPKASTNLLSDAMAQLQMIKIKQ

A0A6J1BXN6 uncharacterized protein LOC1110062591.2e-18272.08Show/hide
Query:  QPLQSAITTPTSKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEH
        QP     +  + KLRLMCSYGG ITPR  TK+  YLGGETRIISVDP  VNTLSAFISHLLTIL I PPF+LKY LP SALDSLISLSSDDDL FM CEH
Subjt:  QPLQSAITTPTSKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEH

Query:  LRLSSSTS-----SRIRLFIFFPEPEKP---HNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSS
        LRLSSS S     SRIRLF+FFPEPEKP    NVIHHPKTEAW VDAL+SAKILQKGRDC   FDGEGLIGENE KG  DLG GGV SL ESMVLETSSS
Subjt:  LRLSSSTS-----SRIRLFIFFPEPEKP---HNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSS

Query:  FGSSSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPF
        FGSSSSSASLANV   IKP +EDF  SSLDN             +AS+IA T+SCSS+EN VMSIPVIS S FHD  AG+HP+N  DFSGY L PRPN F
Subjt:  FGSSSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPF

Query:  QQQELQFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNLPVHWGLHDAATTSSTHTLVLPDASPVVPLPQVAYKE
        Q Q LQFVQA  PVESCLP+++ M SYYP QQPQFLHYQPMPN MY ++FLPVG TQ+S+PSNLP+ WGL DAAT S +H LV PDASPV  L QVAYKE
Subjt:  QQQELQFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNLPVHWGLHDAATTSSTHTLVLPDASPVVPLPQVAYKE

Query:  VMPKPHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQPPPPLVPSALQSKPKASTNLLSDAMAQLQM
        V+P+PH QN GA P LANP++ E AD+V+Q+PV+  I ND A   SGEVAR RNECN+DD ART IYKSQPPPPLVPS LQSK  AST LLSDAMAQLQM
Subjt:  VMPKPHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQPPPPLVPSALQSKPKASTNLLSDAMAQLQM

Query:  IKIKQ
        IKIKQ
Subjt:  IKIKQ

A0A6J1FEI8 uncharacterized protein LOC1114447801.1e-16267.79Show/hide
Query:  MDQQPLQSAITTPTSKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMF
        MD  P QS     TSKLRLMCSYGGHITPR  TKA SYLGGETRIISVD T VNTLSAFISHLLTILPIKPPFSLKY LP SALDSLISLSSDDDLHFM 
Subjt:  MDQQPLQSAITTPTSKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMF

Query:  CEHLRLSSSTSSRIRLFIFFPEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSSFGSSS
        CEHLRL+SSTSSRIRLF+FFPEPEK  NVIHHPKTEAWFVDALK AKILQKG+DCL  FD EG+IGENEAKG  DLGNG V SLPESM+LET+SSFGSSS
Subjt:  CEHLRLSSSTSSRIRLFIFFPEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSSFGSSS

Query:  SSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPFQQQEL
        SS S ANVS PIK QSEDFGLS  DN AK  T SDS ATL S+IAPTNSC SVEN                           FSG+AL  + NPFQQQ  
Subjt:  SSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPFQQQEL

Query:  QFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPN-PMYSVFFLPVGHTQISSPSNLPVHWGLHDAATTSSTHTLVLPDASPVVPLPQVAYKEVMPK
        QFVQA  P+ESCLPA+Y MPSYYPVQQPQF+HYQPMPN  MY V+FLPVG TQ+  PSNLP+HWG HD AT SS                          
Subjt:  QFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPN-PMYSVFFLPVGHTQISSPSNLPVHWGLHDAATTSSTHTLVLPDASPVVPLPQVAYKEVMPK

Query:  PHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQP-PPPLV-PSALQSKPKASTNLLSDAMAQLQMIK
          + N G     ANPI+ E+AD+ Q++  AV I N  AADAS  V    +ECNEDDPARTLIYKSQP PPPLV PS  Q+K K STNLLSDAMAQLQ+IK
Subjt:  PHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPARTLIYKSQP-PPPLV-PSALQSKPKASTNLLSDAMAQLQMIK

Query:  IKQ
        IKQ
Subjt:  IKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G25300.1 Octicosapeptide/Phox/Bem1p family protein5.2e-1345.19Show/hide
Query:  KLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEHLRLSSSTSSRIR
        KL L+CSYGG I P  P K+  Y+GGETR++ V P  ++ L  F   L   L     FSLKY LP    DSLI++S ++DL  M  E+    S+   RIR
Subjt:  KLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEHLRLSSSTSSRIR

Query:  LFIF
        LF+F
Subjt:  LFIF

AT2G01190.1 Octicosapeptide/Phox/Bem1p family protein6.6e-2431.51Show/hide
Query:  TSKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEHLR-LSSSTS-
        +SKLR MCSYGGHI PR   K+  Y+GG+TRI+ VD    ++L + I+ L   L     F+LKY LP   LDSLIS+++D+DL  M  E+ R +S+S S 
Subjt:  TSKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEHLR-LSSSTS-

Query:  --SRIRLFIFFPEPEKPHN----VIHHPKTEAWFVDALKSAKILQKG--------RDCLGVFDGEGL---IGENEAKGFAD-------------------
          SR+RLF+F  +PE   +    +    K++ WF++AL SA +L +G           LG+ D   L    G+N  +   D                   
Subjt:  --SRIRLFIFFPEPEKPHN----VIHHPKTEAWFVDALKSAKILQKG--------RDCLGVFDGEGL---IGENEAKGFAD-------------------

Query:  -LGNGGVSSLPESMVLETSSSFGSSSSSASLANVSSPIKPQSED-FGLSSLDNPAKL---------------QTASDSVATLASDIAPTNSCSSVENVVM
          G   V+ LP+S +L+TSSSFGS+SSS SLAN+  PI+   E+  G+ +L +   L               Q   D  A ++S          V   + 
Subjt:  -LGNGGVSSLPESMVLETSSSFGSSSSSASLANVSSPIKPQSED-FGLSSLDNPAKL---------------QTASDSVATLASDIAPTNSCSSVENVVM

Query:  SIPVISGSKFHDLPAGVH---PRNPHDF-SGYALPPRPNPFQQQELQFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMY----SVFFLPVGH
        + PV + +  ++  A V+    R+ H   +GY  PP P   Q Q L   QA   ++S     +++PS   V          M NPM+    SV+  P+  
Subjt:  SIPVISGSKFHDLPAGVH---PRNPHDF-SGYALPPRPNPFQQQELQFVQASAPVESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMY----SVFFLPVGH

Query:  TQISSPSNLPVHWGL---HDAATTSSTHTLVLPDASPVVPL-PQVAYKEVMPKPHSQ
             PS   V  G+    D +T  S H     +  P   L PQ   +    +P  Q
Subjt:  TQISSPSNLPVHWGL---HDAATTSSTHTLVLPDASPVVPL-PQVAYKEVMPKPHSQ

AT3G18230.1 Octicosapeptide/Phox/Bem1p family protein5.2e-2129.18Show/hide
Query:  PTSKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEHLRLSSSTSS
        P +KLRLMCS+GGHI PR   K+ +Y GGETRI+ VD     +LS+  S L ++L     F+LKY LP   LDSL+++++D+DL  M  E+ R +SS ++
Subjt:  PTSKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEHLRLSSSTSS

Query:  ----RIRLFIFFPEPEKP---HNVIHHPKTEAWFVDALKSAKILQKG-RDCLGVFD----------GEGLI--------GENEAKGFADLGNGGV-----
            R+RLF+F  + E      +++   K++ WFVDAL  + +L +G  D   V +          GE  I        GEN  +G  DL   GV     
Subjt:  ----RIRLFIFFPEPEKP---HNVIHHPKTEAWFVDALKSAKILQKG-RDCLGVFD----------GEGLI--------GENEAKGFADLGNGGV-----

Query:  ---SSLPESMVLETS-SSFGSSSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHP
           SS+P+S ++E + SS GSSSSS S +N+  PI+ +  +      D   + Q A  + + + +     +  S + N  M IP                
Subjt:  ---SSLPESMVLETS-SSFGSSSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHP

Query:  RNPHDFSGYALPPRPNPFQQQELQFVQASAPVESCLPAV--YQMPSYYPVQQPQFLHYQPMPNPMYSVFFLP--VGHTQISSPSNLPVHWGLHDAATTSS
                        P    +      +AP +   PA   +  P     +      Y+  P PM  V   P  VG   ++SP ++     +  A  TS 
Subjt:  RNPHDFSGYALPPRPNPFQQQELQFVQASAPVESCLPAV--YQMPSYYPVQQPQFLHYQPMPNPMYSVFFLP--VGHTQISSPSNLPVHWGLHDAATTSS

Query:  THTLVLPDASPVVP-LPQVAYKEVMPK------PHSQNLGAMPS---LANPISPESADDVQQKPVA
        +  +   D  P +P  P VA  E          P ++N  A  +   L+ P +  + D  QQ+P+A
Subjt:  THTLVLPDASPVVP-LPQVAYKEVMPK------PHSQNLGAMPS---LANPISPESADDVQQKPVA

AT5G09620.1 Octicosapeptide/Phox/Bem1p family protein2.6e-1241.74Show/hide
Query:  KLRLMCSYGGHITPRLPTKAFSYLGGETRIISVD-----PTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEHLRL--SS
        K++LMCSYGG I PR      +Y+ G+T+I+SVD     P  V+ LSA  S            S KY LP   LD+LIS+++D+DL  M  E+ RL   S
Subjt:  KLRLMCSYGGHITPRLPTKAFSYLGGETRIISVD-----PTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEHLRL--SS

Query:  STSSRIRLFIFFPEP
        +  +R+RLF+F   P
Subjt:  STSSRIRLFIFFPEP

AT5G16220.1 Octicosapeptide/Phox/Bem1p family protein2.6e-3631.32Show/hide
Query:  KLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEHLRLSSSTS---S
        KLR+MC YGG I     TK+  Y+GG+TRI+++  +   + ++ +SHL   L I  PF +KY LP   LDSLIS+ +D+D+  M  EH  LSS +S   S
Subjt:  KLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEHLRLSSSTS---S

Query:  RIRLFIF------------------------------FPEPEKP------HNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADL
        RIRLF+F                                E  KP        V+ HPKTE WFVDALKS +++Q  R   G              G  D 
Subjt:  RIRLFIF------------------------------FPEPEKP------HNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADL

Query:  GNGGVSSLPESMVLETSSSFGSSSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVH
        GNGG+    ESM+LET+SSFGS+SSS S +N+  PIK   E                 D++A      AP  S +S +N  ++ P+ S    H+LP+  H
Subjt:  GNGGVSSLPESMVLETSSSFGSSSSSASLANVSSPIKPQSEDFGLSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVH

Query:  -----PRN-----------PHDFSGYALPPRPNPFQQQELQFVQASAP-VESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNL
             P +           P   SGY  PP  N  QQQ +Q +    P +    P      +Y+       ++YQ  P P Y ++++PV          L
Subjt:  -----PRN-----------PHDFSGYALPPRPNPFQQQELQFVQASAP-VESCLPAVYQMPSYYPVQQPQFLHYQPMPNPMYSVFFLPVGHTQISSPSNL

Query:  PVHWGLHDAATTSSTHTLVLPDASPVVPLPQVAYKEVMPKPHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPART
        PV       +T  + H +  P      PL      +V P     +     S    ++  S D       A I   DV  D              +D A  
Subjt:  PVHWGLHDAATTSSTHTLVLPDASPVVPLPQVAYKEVMPKPHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADASGEVARTRNECNEDDPART

Query:  LIYKSQPPPPLVPS
         IYKSQPP P +PS
Subjt:  LIYKSQPPPPLVPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCAACAACCGCTACAATCCGCCATCACCACCCCCACCTCAAAGCTCCGTTTGATGTGCAGCTACGGCGGCCACATAACCCCACGCCTCCCCACCAAGGCCTTCTC
CTATTTAGGCGGCGAAACACGCATAATCTCTGTCGACCCAACCACCGTCAACACTCTCTCCGCCTTCATTTCTCACCTCCTAACAATTCTCCCCATTAAACCCCCATTTT
CCCTCAAGTATCACCTCCCTCAGTCCGCCCTCGATTCTCTCATCTCCCTCTCCTCCGATGACGACCTTCATTTCATGTTCTGCGAGCATCTCCGTCTCTCCTCTTCTACT
TCCTCTCGCATTCGGCTCTTCATCTTTTTCCCCGAACCCGAAAAGCCTCATAATGTTATTCATCATCCCAAGACGGAGGCCTGGTTCGTCGATGCGCTTAAGAGTGCGAA
GATTTTGCAGAAGGGGCGCGATTGCTTGGGGGTTTTTGATGGGGAAGGGTTGATTGGAGAGAATGAAGCTAAGGGTTTTGCAGATTTGGGTAATGGGGGTGTTTCTTCTT
TGCCGGAGTCCATGGTTTTGGAGACCAGTTCTTCTTTTGGATCATCGTCTTCTTCGGCCTCTTTGGCTAATGTGTCTTCTCCCATTAAACCTCAGAGTGAGGATTTTGGA
CTCAGTTCGCTGGATAATCCTGCTAAGCTGCAGACAGCTTCGGATTCCGTCGCAACCCTCGCGAGCGATATTGCCCCTACCAACTCATGTTCTTCAGTTGAGAATGTGGT
TATGTCTATTCCTGTGATATCAGGGAGTAAATTTCACGACCTCCCTGCTGGAGTTCATCCCCGGAACCCCCATGATTTTTCAGGCTATGCACTACCTCCCCGGCCGAACC
CTTTTCAGCAGCAGGAATTGCAGTTTGTTCAAGCAAGCGCGCCGGTAGAGAGCTGCCTTCCTGCTGTGTATCAAATGCCTTCTTACTATCCAGTCCAGCAGCCTCAGTTT
CTGCATTACCAGCCGATGCCGAACCCTATGTATTCTGTGTTCTTTTTGCCTGTTGGACATACACAGATTTCATCCCCTTCCAATCTACCCGTGCACTGGGGCTTGCACGA
TGCTGCGACCACGAGTTCAACTCATACGTTGGTTCTGCCTGATGCTTCACCTGTTGTTCCTCTTCCTCAGGTAGCTTACAAGGAGGTGATGCCTAAGCCACATTCACAGA
ATCTTGGAGCAATGCCATCTCTTGCTAATCCGATTTCTCCCGAGTCTGCTGACGATGTTCAGCAGAAGCCGGTAGCAGTAATAATTCCTAATGATGTTGCAGCTGATGCA
TCTGGTGAAGTTGCACGTACGCGTAACGAATGCAACGAGGATGATCCTGCAAGGACCCTAATATACAAATCTCAGCCTCCGCCACCACTAGTTCCTTCTGCATTGCAAAG
TAAACCTAAAGCCTCGACGAACCTTCTGTCGGATGCGATGGCACAGCTGCAGATGATAAAAATCAAGCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGACCAACAACCGCTACAATCCGCCATCACCACCCCCACCTCAAAGCTCCGTTTGATGTGCAGCTACGGCGGCCACATAACCCCACGCCTCCCCACCAAGGCCTTCTC
CTATTTAGGCGGCGAAACACGCATAATCTCTGTCGACCCAACCACCGTCAACACTCTCTCCGCCTTCATTTCTCACCTCCTAACAATTCTCCCCATTAAACCCCCATTTT
CCCTCAAGTATCACCTCCCTCAGTCCGCCCTCGATTCTCTCATCTCCCTCTCCTCCGATGACGACCTTCATTTCATGTTCTGCGAGCATCTCCGTCTCTCCTCTTCTACT
TCCTCTCGCATTCGGCTCTTCATCTTTTTCCCCGAACCCGAAAAGCCTCATAATGTTATTCATCATCCCAAGACGGAGGCCTGGTTCGTCGATGCGCTTAAGAGTGCGAA
GATTTTGCAGAAGGGGCGCGATTGCTTGGGGGTTTTTGATGGGGAAGGGTTGATTGGAGAGAATGAAGCTAAGGGTTTTGCAGATTTGGGTAATGGGGGTGTTTCTTCTT
TGCCGGAGTCCATGGTTTTGGAGACCAGTTCTTCTTTTGGATCATCGTCTTCTTCGGCCTCTTTGGCTAATGTGTCTTCTCCCATTAAACCTCAGAGTGAGGATTTTGGA
CTCAGTTCGCTGGATAATCCTGCTAAGCTGCAGACAGCTTCGGATTCCGTCGCAACCCTCGCGAGCGATATTGCCCCTACCAACTCATGTTCTTCAGTTGAGAATGTGGT
TATGTCTATTCCTGTGATATCAGGGAGTAAATTTCACGACCTCCCTGCTGGAGTTCATCCCCGGAACCCCCATGATTTTTCAGGCTATGCACTACCTCCCCGGCCGAACC
CTTTTCAGCAGCAGGAATTGCAGTTTGTTCAAGCAAGCGCGCCGGTAGAGAGCTGCCTTCCTGCTGTGTATCAAATGCCTTCTTACTATCCAGTCCAGCAGCCTCAGTTT
CTGCATTACCAGCCGATGCCGAACCCTATGTATTCTGTGTTCTTTTTGCCTGTTGGACATACACAGATTTCATCCCCTTCCAATCTACCCGTGCACTGGGGCTTGCACGA
TGCTGCGACCACGAGTTCAACTCATACGTTGGTTCTGCCTGATGCTTCACCTGTTGTTCCTCTTCCTCAGGTAGCTTACAAGGAGGTGATGCCTAAGCCACATTCACAGA
ATCTTGGAGCAATGCCATCTCTTGCTAATCCGATTTCTCCCGAGTCTGCTGACGATGTTCAGCAGAAGCCGGTAGCAGTAATAATTCCTAATGATGTTGCAGCTGATGCA
TCTGGTGAAGTTGCACGTACGCGTAACGAATGCAACGAGGATGATCCTGCAAGGACCCTAATATACAAATCTCAGCCTCCGCCACCACTAGTTCCTTCTGCATTGCAAAG
TAAACCTAAAGCCTCGACGAACCTTCTGTCGGATGCGATGGCACAGCTGCAGATGATAAAAATCAAGCAATAA
Protein sequenceShow/hide protein sequence
MDQQPLQSAITTPTSKLRLMCSYGGHITPRLPTKAFSYLGGETRIISVDPTTVNTLSAFISHLLTILPIKPPFSLKYHLPQSALDSLISLSSDDDLHFMFCEHLRLSSST
SSRIRLFIFFPEPEKPHNVIHHPKTEAWFVDALKSAKILQKGRDCLGVFDGEGLIGENEAKGFADLGNGGVSSLPESMVLETSSSFGSSSSSASLANVSSPIKPQSEDFG
LSSLDNPAKLQTASDSVATLASDIAPTNSCSSVENVVMSIPVISGSKFHDLPAGVHPRNPHDFSGYALPPRPNPFQQQELQFVQASAPVESCLPAVYQMPSYYPVQQPQF
LHYQPMPNPMYSVFFLPVGHTQISSPSNLPVHWGLHDAATTSSTHTLVLPDASPVVPLPQVAYKEVMPKPHSQNLGAMPSLANPISPESADDVQQKPVAVIIPNDVAADA
SGEVARTRNECNEDDPARTLIYKSQPPPPLVPSALQSKPKASTNLLSDAMAQLQMIKIKQ