; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030383 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030383
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPB1 domain-containing protein
Genome locationchr8:46801199..46803151
RNA-Seq ExpressionLag0030383
SyntenyLag0030383
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000270 - PB1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051068.1 Phox/Bem1p [Cucumis melo var. makuwa]1.7e-18673.44Show/hide
Query:  MCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDDDLQFMLCEHLRL--SSSTSSRIRLF
        MCSYGGHIT RPRTKSLSYLGGETRIISVDPT VNTL++FISHLL ILPIKPPF LKY LP SALDSLISLSSD DL FM  EHLRL  SSS+SSRIRLF
Subjt:  MCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDDDLQFMLCEHLRL--SSSTSSRIRLF

Query:  VLFPEPEKAHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLETSSSFGSSSSSASLANVSPPVKPQSED
        + FPEPEK HNVIHHPKTEAWF DALKSAKILQKGRDC VGFDG+GL+GEN+ KG+ DLGNGG SL ESMVLETSSSFGSSSSSASLANVSPP+K QSED
Subjt:  VLFPEPEKAHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLETSSSFGSSSSSASLANVSPPVKPQSED

Query:  FGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASRPNPFQHQGFQFVQAGAPVESCLPAAYP
        +GLSS+         SD VATLAS+IAPTNSCSSVEN V S+PVISE+NFH+ AAGV  +N  DFSGYA   +PN FQHQ  QFVQ   PVESCLP  Y 
Subjt:  FGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASRPNPFQHQGFQFVQAGAPVESCLPAAYP

Query:  MTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNL-----------------PLQWGVHNTGTGSSTHVSVLPDASPVLPLAQVAYKEVLPEPR
        M SYYP QQ QF+HYQPMPNHMYPVY LPVGQTQVSAPSNL                 P+QWG+H+T T +STH  VLPDASPV+PL QVAYKE++PEP 
Subjt:  MTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNL-----------------PLQWGVHNTGTGSSTHVSVLPDASPVLPLAQVAYKEVLPEPR

Query:  SQNFVAMPSLANPIALQSVDEVQQQPVSISNDAAS---GEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQLQMIQIKQ
        SQN  AMPSLANPI   S DEVQQ PV I ND A+    EV RT +ECND+DP RTLIYKSQP PP     LQSK K STNLLSDAMAQLQMI+I Q
Subjt:  SQNFVAMPSLANPIALQSVDEVQQQPVSISNDAAS---GEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQLQMIQIKQ

XP_008441273.1 PREDICTED: uncharacterized protein LOC103485455 [Cucumis melo]6.5e-19172.61Show/hide
Query:  MDPPPPHPPSPPQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDD
        MDPPP  PPSP    +A+   KLRLMCSYGGHIT RPRTKSLSYLGGETRIISVDPT VNTL++FISHLL ILPIKPPF LKY LP SALDSLISLSSD 
Subjt:  MDPPPPHPPSPPQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDD

Query:  DLQFMLCEHLRL--SSSTSSRIRLFVLFPEPEKAHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLETS
        DL FM  EHLRL  SSS+SSRIRLF+ FPEPEK HNVIHHPKTEAWF DALKSAKILQKGRDC VGFDG+GL+GEN+ KG+ DLGNGG SL ESMVLETS
Subjt:  DLQFMLCEHLRL--SSSTSSRIRLFVLFPEPEKAHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLETS

Query:  SSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASRPN
        SSFGSSSSSASLANVSPP+K QSED+GLSS+         SD VATLAS+IAPTNSCSSVEN V S+PVISE+NFH+ AAGV  +N  DFSGYA   +PN
Subjt:  SSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASRPN

Query:  PFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNL-----------------PLQWGVHNTGTGSSTHV
         FQHQ  QFVQ   PVESCLP  Y M SYYP QQ QF+HYQPMPNHMYPVY LPVGQTQVSAPSNL                 P+QWG+H+T T +STH 
Subjt:  PFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNL-----------------PLQWGVHNTGTGSSTHV

Query:  SVLPDASPVLPLAQVAYKEVLPEPRSQNFVAMPSLANPIALQSVDEVQQQPVSISNDAAS---GEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSK
         VLPDASPV+PL QVAYKE++PEP SQN  AMPSLANPI   S DEVQQ PV I ND A+    EV RT +ECND+DP RTLIYKSQP PP     LQSK
Subjt:  SVLPDASPVLPLAQVAYKEVLPEPRSQNFVAMPSLANPIALQSVDEVQQQPVSISNDAAS---GEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSK

Query:  AKASTNLLSDAMAQLQMIQIKQ
         K STNLLSDAMAQLQMI+I Q
Subjt:  AKASTNLLSDAMAQLQMIQIKQ

XP_022133762.1 uncharacterized protein LOC111006259 [Momordica charantia]6.0e-20577.91Show/hide
Query:  PPHPPSPPQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDDDLQF
        PP PPSPPQSA +    KLRLMCSYGG ITPRPRTKSL YLGGETRIISVDP AVNTL+AFISHLL IL I PPF LKYQLPHSALDSLISLSSDDDLQF
Subjt:  PPHPPSPPQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDDDLQF

Query:  MLCEHLRLSSSTS-----SRIRLFVLFPEPEK---AHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLE
        MLCEHLRLSSS S     SRIRLFV FPEPEK   A NVIHHPKTEAW VDAL+SAKILQKGRDC VGFDG+GLIGEN+ KGV DLG GGVSLAESMVLE
Subjt:  MLCEHLRLSSSTS-----SRIRLFVLFPEPEK---AHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLE

Query:  TSSSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASR
        TSSSFGSSSSSASLANV P +KP +EDF  SSLDNAV            ASEIA T+SCSS+EN VMSIPVISE+ FHDPAAG+HPQN IDFSGY LA R
Subjt:  TSSSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASR

Query:  PNPFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNLPLQWGVHNTGTGSSTHVSVLPDASPVLPLAQV
        PN FQ Q  QFVQAG PVESCLP+ +PM SYYP QQ QFLHYQPMPNHMYP+Y LPVGQTQVS PSNLP+QWG+ +  TGS +H +++PDASPV  L+QV
Subjt:  PNPFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNLPLQWGVHNTGTGSSTHVSVLPDASPVLPLAQV

Query:  AYKEVLPEPRSQNFVAMPSLANPIALQSVDEVQQQPVSISNDAA---SGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQL
        AYKEV+PEP  QNF A P LANP+AL+  DEV+QQPVSISNDAA   SGEVAR RNECNDDD ART IYKSQPPPPLVPSQLQSKA AST LLSDAMAQL
Subjt:  AYKEVLPEPRSQNFVAMPSLANPIALQSVDEVQQQPVSISNDAA---SGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQL

Query:  QMIQIKQ
        QMI+IKQ
Subjt:  QMIQIKQ

XP_031736617.1 uncharacterized protein LOC101214062 [Cucumis sativus]3.9e-18059.59Show/hide
Query:  MDPPPPHPPSPPQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDD
        MDPPP     PP S  A + TKLRLMCSY GHIT RPRTKSLSYLGGETRIISVDPT VNTL+ FISHLL ILPIKPPF LKY LPHSALDSLISLSS D
Subjt:  MDPPPPHPPSPPQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDD

Query:  DLQFMLCEHLRL--SSSTSSRIRLFVLFPEPEKAHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLETS
        DL FM  EHLRL  SSS+SSRIRLF+ FPEPEK HNVIHHPKTEAWF DALKSAKILQKGRDC VGFDG+GLIGEN+ KG+ DLGNGG SL ESMVLETS
Subjt:  DLQFMLCEHLRL--SSSTSSRIRLFVLFPEPEKAHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLETS

Query:  SSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASRPN
        SSFGSSSSSASLANVSPP+KPQSEDFGLSS+         SDSVATLAS+I PTNSCSSVEN V S+PVI+E+NFH+ AAGV  +N  DFSGYA   RPN
Subjt:  SSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASRPN

Query:  PFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSA--------------------------------------
         FQHQ  QFVQ   PVESCLP  Y M SYYPVQQ QF+HYQPMPNHMYPVY LPVGQTQVSA                                      
Subjt:  PFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSA--------------------------------------

Query:  --------------------------------------------------------------------------------------------------PS
                                                                                                          PS
Subjt:  --------------------------------------------------------------------------------------------------PS

Query:  NLPLQWGVHNTGTGSSTHVSVLPDASPVLPLAQVAYKEVLPEPRSQNFVAMPSLANPIALQSVDEVQQQPVSISNDA---ASGEVARTRNECNDDDPART
        NLP+QWG+HNT T  STH  VLPDASPV+PL QVAYKE++PE  SQN  AMPSLANP +L+S DEVQQ PV I ND    AS EV  T +E N+DDP RT
Subjt:  NLPLQWGVHNTGTGSSTHVSVLPDASPVLPLAQVAYKEVLPEPRSQNFVAMPSLANPIALQSVDEVQQQPVSISNDA---ASGEVARTRNECNDDDPART

Query:  LIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQLQMIQIKQ
        LIYKSQP PP     LQSK +ASTNLLSDAMAQLQMI+I Q
Subjt:  LIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQLQMIQIKQ

XP_038884113.1 uncharacterized protein LOC120075037 [Benincasa hispida]7.8e-22181.58Show/hide
Query:  MDPPPPHPPSPPQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDD
        MDPP      PPQSA  A  TKLRLMCSYGGHIT RPRTK+ SYLGGETRIISVDPT VNTL+AFISHLL ILPIK PF LKY LPHSALDSLISLSSDD
Subjt:  MDPPPPHPPSPPQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDD

Query:  DLQFMLCEHLRL--SSSTSSRIRLFVLFPEPEKAHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLETS
        DL FM CEHLRL  SSS+SSRIRLF+  PEPEK HNVIHHPKTEAWFVDALKSAKILQKGRDC VGFDG+GLIGEN+ KGV DLGNGG SL ESMVLETS
Subjt:  DLQFMLCEHLRL--SSSTSSRIRLFVLFPEPEKAHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLETS

Query:  SSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASRPN
        SSFGSSSSSASLANVSPP KPQ+EDFGLSSLDNA  LQT SDS+ATLASEIAPTNSCSSVEN VMSIPVISE+NFH+ AAGV PQN  DFSGYALASRPN
Subjt:  SSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASRPN

Query:  PFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNLPLQWGVHNTGTGSSTHVSVLPDASPVLPLAQVAY
        PFQ Q  QFVQA A VESCLPA Y M SYYPVQQ QF+HYQPMPNH+YPVY LPVGQTQ+SAPSNLP+QWG+ +  T SSTH  VLPDASPV+PL  VAY
Subjt:  PFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNLPLQWGVHNTGTGSSTHVSVLPDASPVLPLAQVAY

Query:  KEVLPEPRSQNFVAMPSLANPIALQSVDEVQQQPVSISNDAA---SGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQLQM
        KEV+PEP SQN  AMPSLANPI+L+S DEVQQQPV I NDAA   SGEVA TRNECN+DDPARTLIYKSQP PPLVPS LQSK KASTNLLSDAMAQL M
Subjt:  KEVLPEPRSQNFVAMPSLANPIALQSVDEVQQQPVSISNDAA---SGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQLQM

Query:  IQIKQ
        I+I+Q
Subjt:  IQIKQ

TrEMBL top hitse value%identityAlignment
A0A0A0LMW1 PB1 domain-containing protein4.0e-19472.99Show/hide
Query:  MDPPPPHPPSPPQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDD
        MDPPP     PP S  A + TKLRLMCSY GHIT RPRTKSLSYLGGETRIISVDPT VNTL+ FISHLL ILPIKPPF LKY LPHSALDSLISLSS D
Subjt:  MDPPPPHPPSPPQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDD

Query:  DLQFMLCEHLRL--SSSTSSRIRLFVLFPEPEKAHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLETS
        DL FM  EHLRL  SSS+SSRIRLF+ FPEPEK HNVIHHPKTEAWF DALKSAKILQKGRDC VGFDG+GLIGEN+ KG+ DLGNGG SL ESMVLETS
Subjt:  DLQFMLCEHLRL--SSSTSSRIRLFVLFPEPEKAHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLETS

Query:  SSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASRPN
        SSFGSSSSSASLANVSPP+KPQSEDFGLSS+         SDSVATLAS+I PTNSCSSVEN V S+PVI+E+NFH+ AAGV  +N  DFSGYA   RPN
Subjt:  SSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASRPN

Query:  PFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSA-----------------PSNLPLQWGVHNTGTGSSTHV
         FQHQ  QFVQ   PVESCLP  Y M SYYPVQQ QF+HYQPMPNHMYPVY LPVGQTQ+SA                 PSNLP+QWG+HNT T  STH 
Subjt:  PFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSA-----------------PSNLPLQWGVHNTGTGSSTHV

Query:  SVLPDASPVLPLAQVAYKEVLPEPRSQNFVAMPSLANPIALQSVDEVQQQPVSISNDA---ASGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSK
         VLPDASPV+PL QVAYKE++PE  SQN  AMPSLANP +L+S DEVQQ PV I ND    AS EV  T +E N+DDP RTLIYKSQP PP     LQSK
Subjt:  SVLPDASPVLPLAQVAYKEVLPEPRSQNFVAMPSLANPIALQSVDEVQQQPVSISNDA---ASGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSK

Query:  AKASTNLLSDAMAQLQMIQIKQ
         +ASTNLLSDAMAQLQMI+I Q
Subjt:  AKASTNLLSDAMAQLQMIQIKQ

A0A1S3B318 uncharacterized protein LOC1034854553.1e-19172.61Show/hide
Query:  MDPPPPHPPSPPQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDD
        MDPPP  PPSP    +A+   KLRLMCSYGGHIT RPRTKSLSYLGGETRIISVDPT VNTL++FISHLL ILPIKPPF LKY LP SALDSLISLSSD 
Subjt:  MDPPPPHPPSPPQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDD

Query:  DLQFMLCEHLRL--SSSTSSRIRLFVLFPEPEKAHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLETS
        DL FM  EHLRL  SSS+SSRIRLF+ FPEPEK HNVIHHPKTEAWF DALKSAKILQKGRDC VGFDG+GL+GEN+ KG+ DLGNGG SL ESMVLETS
Subjt:  DLQFMLCEHLRL--SSSTSSRIRLFVLFPEPEKAHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLETS

Query:  SSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASRPN
        SSFGSSSSSASLANVSPP+K QSED+GLSS+         SD VATLAS+IAPTNSCSSVEN V S+PVISE+NFH+ AAGV  +N  DFSGYA   +PN
Subjt:  SSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASRPN

Query:  PFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNL-----------------PLQWGVHNTGTGSSTHV
         FQHQ  QFVQ   PVESCLP  Y M SYYP QQ QF+HYQPMPNHMYPVY LPVGQTQVSAPSNL                 P+QWG+H+T T +STH 
Subjt:  PFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNL-----------------PLQWGVHNTGTGSSTHV

Query:  SVLPDASPVLPLAQVAYKEVLPEPRSQNFVAMPSLANPIALQSVDEVQQQPVSISNDAAS---GEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSK
         VLPDASPV+PL QVAYKE++PEP SQN  AMPSLANPI   S DEVQQ PV I ND A+    EV RT +ECND+DP RTLIYKSQP PP     LQSK
Subjt:  SVLPDASPVLPLAQVAYKEVLPEPRSQNFVAMPSLANPIALQSVDEVQQQPVSISNDAAS---GEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSK

Query:  AKASTNLLSDAMAQLQMIQIKQ
         K STNLLSDAMAQLQMI+I Q
Subjt:  AKASTNLLSDAMAQLQMIQIKQ

A0A5D3BU91 Phox/Bem1p8.0e-18773.44Show/hide
Query:  MCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDDDLQFMLCEHLRL--SSSTSSRIRLF
        MCSYGGHIT RPRTKSLSYLGGETRIISVDPT VNTL++FISHLL ILPIKPPF LKY LP SALDSLISLSSD DL FM  EHLRL  SSS+SSRIRLF
Subjt:  MCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDDDLQFMLCEHLRL--SSSTSSRIRLF

Query:  VLFPEPEKAHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLETSSSFGSSSSSASLANVSPPVKPQSED
        + FPEPEK HNVIHHPKTEAWF DALKSAKILQKGRDC VGFDG+GL+GEN+ KG+ DLGNGG SL ESMVLETSSSFGSSSSSASLANVSPP+K QSED
Subjt:  VLFPEPEKAHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLETSSSFGSSSSSASLANVSPPVKPQSED

Query:  FGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASRPNPFQHQGFQFVQAGAPVESCLPAAYP
        +GLSS+         SD VATLAS+IAPTNSCSSVEN V S+PVISE+NFH+ AAGV  +N  DFSGYA   +PN FQHQ  QFVQ   PVESCLP  Y 
Subjt:  FGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASRPNPFQHQGFQFVQAGAPVESCLPAAYP

Query:  MTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNL-----------------PLQWGVHNTGTGSSTHVSVLPDASPVLPLAQVAYKEVLPEPR
        M SYYP QQ QF+HYQPMPNHMYPVY LPVGQTQVSAPSNL                 P+QWG+H+T T +STH  VLPDASPV+PL QVAYKE++PEP 
Subjt:  MTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNL-----------------PLQWGVHNTGTGSSTHVSVLPDASPVLPLAQVAYKEVLPEPR

Query:  SQNFVAMPSLANPIALQSVDEVQQQPVSISNDAAS---GEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQLQMIQIKQ
        SQN  AMPSLANPI   S DEVQQ PV I ND A+    EV RT +ECND+DP RTLIYKSQP PP     LQSK K STNLLSDAMAQLQMI+I Q
Subjt:  SQNFVAMPSLANPIALQSVDEVQQQPVSISNDAAS---GEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQLQMIQIKQ

A0A6J1BXN6 uncharacterized protein LOC1110062592.9e-20577.91Show/hide
Query:  PPHPPSPPQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDDDLQF
        PP PPSPPQSA +    KLRLMCSYGG ITPRPRTKSL YLGGETRIISVDP AVNTL+AFISHLL IL I PPF LKYQLPHSALDSLISLSSDDDLQF
Subjt:  PPHPPSPPQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDDDLQF

Query:  MLCEHLRLSSSTS-----SRIRLFVLFPEPEK---AHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLE
        MLCEHLRLSSS S     SRIRLFV FPEPEK   A NVIHHPKTEAW VDAL+SAKILQKGRDC VGFDG+GLIGEN+ KGV DLG GGVSLAESMVLE
Subjt:  MLCEHLRLSSSTS-----SRIRLFVLFPEPEK---AHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLE

Query:  TSSSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASR
        TSSSFGSSSSSASLANV P +KP +EDF  SSLDNAV            ASEIA T+SCSS+EN VMSIPVISE+ FHDPAAG+HPQN IDFSGY LA R
Subjt:  TSSSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASR

Query:  PNPFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNLPLQWGVHNTGTGSSTHVSVLPDASPVLPLAQV
        PN FQ Q  QFVQAG PVESCLP+ +PM SYYP QQ QFLHYQPMPNHMYP+Y LPVGQTQVS PSNLP+QWG+ +  TGS +H +++PDASPV  L+QV
Subjt:  PNPFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNLPLQWGVHNTGTGSSTHVSVLPDASPVLPLAQV

Query:  AYKEVLPEPRSQNFVAMPSLANPIALQSVDEVQQQPVSISNDAA---SGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQL
        AYKEV+PEP  QNF A P LANP+AL+  DEV+QQPVSISNDAA   SGEVAR RNECNDDD ART IYKSQPPPPLVPSQLQSKA AST LLSDAMAQL
Subjt:  AYKEVLPEPRSQNFVAMPSLANPIALQSVDEVQQQPVSISNDAA---SGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQL

Query:  QMIQIKQ
        QMI+IKQ
Subjt:  QMIQIKQ

A0A6J1E0T6 uncharacterized protein LOC1114297792.0e-17470.78Show/hide
Query:  PPSPPQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDDDLQFMLC
        P SPPQ A+ A+PTKLRLMCSYGGH+TPRPRTKSLSYLGGETRIISVDPT VNTL++FISHLL ILPIKPPF LKYQLPHS LDSLISLSSDDDLQ ML 
Subjt:  PPSPPQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDDDLQFMLC

Query:  EHLRLSSSTSSRIRLFVLFPEPEKAHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLETSSSFGSSSSS
         HL LSSSTSSRIRLF+LFPEPEK  NVIHHPKTEAWFVDALKSAKI QKGRD  VGFDGD LIGEN+AK VADLGNGGVSLAESM+LETSSSF SSSSS
Subjt:  EHLRLSSSTSSRIRLFVLFPEPEKAHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLETSSSFGSSSSS

Query:  ASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASRPNPFQHQGFQF
                       DFGLSSLDN VKL TTSDSVATL+S           ENPV S       NFH   +GV+PQN I FSGYALASRPN FQ Q  + 
Subjt:  ASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASRPNPFQHQGFQF

Query:  VQAGAPVESC-LPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNLPLQWGVHNTGTGSSTHVSVLPDASPVLPLAQVAYKEVLPEPR
              V+SC LPA YPM SYYPVQQ QF+HYQPMP+H+YPVY+LPVGQT+VSAPSNLP QW +HN  TGS +H           PL QVAYKEV PEPR
Subjt:  VQAGAPVESC-LPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNLPLQWGVHNTGTGSSTHVSVLPDASPVLPLAQVAYKEVLPEPR

Query:  SQNFVAMPSLANPIALQSVDEVQQQPVSISND---AASGEVARTRNEC-----NDDDPARTLIYKSQ--PPPPLVPSQLQSKAKASTNLLSDAMAQLQMI
        +Q F AM       A++S D VQQQPV+ISND   AASGEVA T NEC     N+DDP RTLIYKSQ  PPPPLVPSQLQSKA+A+TN+LSDAM+QLQMI
Subjt:  SQNFVAMPSLANPIALQSVDEVQQQPVSISND---AASGEVARTRNEC-----NDDDPARTLIYKSQ--PPPPLVPSQLQSKAKASTNLLSDAMAQLQMI

Query:  QIK
        Q K
Subjt:  QIK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01190.1 Octicosapeptide/Phox/Bem1p family protein1.3e-2430.57Show/hide
Query:  AASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDDDLQFMLCEHLR-LSSS
        +A  +KLR MCSYGGHI PRP  KSL Y+GG+TRI+ VD    ++L + I+ L   L     F LKYQLP   LDSLIS+++D+DL  M+ E+ R +S+S
Subjt:  AASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDDDLQFMLCEHLR-LSSS

Query:  TS---SRIRLFVLFPEPEKAHN----VIHHPKTEAWFVDALKSAKILQKGRDCFVGFDG--DGLIGENDAKGV---------ADLGNGGVS---------
         S   SR+RLF+   +PE   +    +    K++ WF++AL SA +L +G   F   D   + L+G +DA  +          D  +G V          
Subjt:  TS---SRIRLFVLFPEPEKAHN----VIHHPKTEAWFVDALKSAKILQKGRDCFVGFDG--DGLIGENDAKGV---------ADLGNGGVS---------

Query:  ---------------LAESMVLETSSSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNF
                       L +S +L+TSSSFGS+SSS SLAN+ PP++   E+ G                V TL     P      +E       V +++  
Subjt:  ---------------LAESMVLETSSSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNF

Query:  HDPAAGVHPQNGIDFSGYALASRPNPFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPV---YLLPVGQTQVSAPSNLPLQWG-
        HD              G+A  S P P             PV   LPAA P+T+     + Q   Y       + V   Y  P   T  S P NLP Q   
Subjt:  HDPAAGVHPQNGIDFSGYALASRPNPFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPV---YLLPVGQTQVSAPSNLPLQWG-

Query:  --VHNTGTGSS--THVSVLPDAS---PVLPLAQVAYKEVLPEPRSQNFV--AMPSLANPIALQSVDEVQQQPVSISNDAASGEVARTRNECNDDDPARTL
            N+G G    +  SV  D+S   P+       Y+E + +  S + V   M + ++P  L S  +  Q P  I +     + A+++       P +  
Subjt:  --VHNTGTGSS--THVSVLPDAS---PVLPLAQVAYKEVLPEPRSQNFV--AMPSLANPIALQSVDEVQQQPVSISNDAASGEVARTRNECNDDDPARTL

Query:  IYKSQPP
        I+ + PP
Subjt:  IYKSQPP

AT3G18230.1 Octicosapeptide/Phox/Bem1p family protein7.7e-2528.9Show/hide
Query:  PQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDDDLQFMLCEHLR
        P+   A    KLRLMCS+GGHI PRP  KSL+Y GGETRI+ VD  A  +L++  S L ++L     F LKYQLP   LDSL+++++D+DL+ M+ E+ R
Subjt:  PQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDDDLQFMLCEHLR

Query:  LSSSTSS----RIRLFVLFPEPEKA---HNVIHHPKTEAWFVDALKSAKILQKG-------RDCFVGFD----GDGLI--------GENDAKGVADLGNG
         +SS ++    R+RLF+   + E A    +++   K++ WFVDAL  + +L +G        +  V  D    G+  I        GEN+ +G  DL   
Subjt:  LSSSTSS----RIRLFVLFPEPEKA---HNVIHHPKTEAWFVDALKSAKILQKG-------RDCFVGFD----GDGLI--------GENDAKGVADLGNG

Query:  GV---------SLAESMVLETS-SSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHD
        GV         S+ +S ++E + SS GSSSSS S +N+ PP++ +  +      D  ++ Q    + + + ++    +  S + N  M IP  + N+   
Subjt:  GV---------SLAESMVLETS-SSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHD

Query:  PAAGVHPQNGIDFSGYALASRPNPFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLP--VGQTQVSAPSNLPLQWGVHNT
         AA  +     D +  A     +P   +    V  G                          Y+  P  M PV + P  VG   +++P ++       +T
Subjt:  PAAGVHPQNGIDFSGYALASRPNPFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLP--VGQTQVSAPSNLPLQWGVHNT

Query:  GTGSSTHVS----------VLPDASPVLPLAQVAYKEVLPEPRSQNFVAMPS---LANPIALQSVDEVQQQPVS
           S+T  S           LP A PV      + +     P+++N  A  +   L+ P    ++D+ QQQP++
Subjt:  GTGSSTHVS----------VLPDASPVLPLAQVAYKEVLPEPRSQNFVAMPS---LANPIALQSVDEVQQQPVS

AT4G05150.1 Octicosapeptide/Phox/Bem1p family protein5.8e-1226.9Show/hide
Query:  PPSPPQSAVAASP-------TKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDD
        PP     ++A+SP        ++R MC++GG I PRP    L Y+GG+ R+++V     +T  A +   LA L  K    +KYQLP+  LD+LIS+S+D+
Subjt:  PPSPPQSAVAASP-------TKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDD

Query:  DLQFMLCEHLRLSSS---TSSRIRLFVLF-------PEPEKAHNVI----HHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGG
        D++ M+ E+ R++ +    +SR+RLF+             +A ++          E WF+DAL                     +G + A     + NGG
Subjt:  DLQFMLCEHLRLSSS---TSSRIRLFVLF-------PEPEKAHNVI----HHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGG

Query:  VSLAESMVLETSSSFGSSSSS--ASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFH-DPAAGVHPQ
               V    SS  S        L N      P      L   D   K+Q     V+TL+   +P     S      S PV+  +     P   + P+
Subjt:  VSLAESMVLETSSSFGSSSSS--ASLANVSPPVKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFH-DPAAGVHPQ

Query:  NGIDFSGYALASRPNPFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNLPLQWGVHNTGTGS
        +    S      + NP   Q  Q  Q+  PV S           Y     Q +HYQ    H  PVY +P      S P N  +Q G H    G+
Subjt:  NGIDFSGYALASRPNPFQHQGFQFVQAGAPVESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNLPLQWGVHNTGTGS

AT5G09620.1 Octicosapeptide/Phox/Bem1p family protein1.0e-1344.14Show/hide
Query:  KLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILP---IKPPFYLKYQLPHSALDSLISLSSDDDLQFMLCEHLRL--SSST
        K++LMCSYGG I PRP    L+Y+ G+T+I+SVD        A +S L A+            KYQLP   LD+LIS+++D+DL+ M+ E+ RL   S+ 
Subjt:  KLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILP---IKPPFYLKYQLPHSALDSLISLSSDDDLQFMLCEHLRL--SSST

Query:  SSRIRLFVLFP
         +R+RLF LFP
Subjt:  SSRIRLFVLFP

AT5G16220.1 Octicosapeptide/Phox/Bem1p family protein2.3e-4533.53Show/hide
Query:  KLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDDDLQFMLCEHLRLSSSTS---S
        KLR+MC YGG I   P+TKS  Y+GG+TRI+++  +A  + A+ +SHL   L I  PF +KYQLP   LDSLIS+ +D+D+Q M+ EH  LSS +S   S
Subjt:  KLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDDDLQFMLCEHLRLSSSTS---S

Query:  RIRLFVLFPEPEKAHN-------------------------------------VIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVAD
        RIRLF LFP   +  N                                     V+ HPKTE WFVDALKS +++Q  R         G  G  D      
Subjt:  RIRLFVLFPEPEKAHN-------------------------------------VIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVAD

Query:  LGNGGVSLAESMVLETSSSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTT----SDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPA
         GNGG+   ESM+LET+SSFGS+SSS S +N+ PP+K   ED   +S      +++     + +V  + S   P++S  + EN   S   ++E N   P 
Subjt:  LGNGGVSLAESMVLETSSSFGSSSSSASLANVSPPVKPQSEDFGLSSLDNAVKLQTT----SDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPA

Query:  AGVHPQNGIDFSGYALASRPNPFQHQGFQFVQAGAP-VESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNLPLQWGVHNTGTG
                +  SGY      N  Q Q  Q +  G P +    P   P T+Y+    +  ++YQ  P   YP+Y +PV Q                     
Subjt:  AGVHPQNGIDFSGYALASRPNPFQHQGFQFVQAGAP-VESCLPAAYPMTSYYPVQQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNLPLQWGVHNTGTG

Query:  SSTHVSVLPDASPVLPLAQVAYKEV-LPEPRSQNFVA--MPSLANPIALQSVDEVQQQPVSISNDAASGEVARTRNECNDDDPARTLIYKSQPPPPLVPS
        SS HV  L    PV P   + Y +V  P  R+ + +A    S   P++      VQ    +  N  +        +  +D+D A   IYKSQPP P +PS
Subjt:  SSTHVSVLPDASPVLPLAQVAYKEV-LPEPRSQNFVA--MPSLANPIALQSVDEVQQQPVSISNDAASGEVARTRNECNDDDPARTLIYKSQPPPPLVPS

Query:  Q
        Q
Subjt:  Q


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCCACCACCGCCACATCCACCGTCGCCGCCCCAATCCGCCGTTGCCGCCTCCCCCACTAAGCTCCGTTTGATGTGCAGCTACGGCGGCCACATAACCCCACGGCC
GCGCACCAAGTCCCTCTCTTATCTGGGCGGCGAAACCCGCATAATCTCCGTCGACCCAACCGCCGTCAACACTCTCGCCGCCTTCATTTCTCATCTCCTCGCGATCCTCC
CCATTAAACCTCCCTTTTACCTCAAGTATCAGCTCCCACACTCCGCCCTCGACTCTCTCATCTCCCTCTCCTCCGATGACGACCTTCAGTTCATGCTCTGCGAGCACCTC
CGCCTCTCCTCTTCCACTTCCTCTCGCATTAGGCTCTTCGTCTTATTCCCCGAGCCAGAGAAGGCTCACAACGTTATTCATCATCCCAAGACTGAGGCCTGGTTCGTCGA
TGCGCTAAAGAGTGCGAAGATTTTGCAGAAGGGGCGCGATTGCTTCGTGGGGTTTGATGGGGACGGGTTGATTGGAGAGAATGATGCTAAGGGTGTTGCAGATTTGGGTA
ATGGGGGTGTTTCTTTGGCTGAATCCATGGTCTTGGAGACCAGTTCTTCCTTTGGATCATCGTCGTCTTCCGCTTCTTTGGCTAATGTGTCTCCTCCTGTTAAACCTCAG
AGTGAGGACTTTGGACTCAGTTCATTGGATAATGCTGTCAAGCTGCAGACAACTTCCGATTCCGTCGCAACACTCGCGAGCGAGATTGCCCCTACCAACTCTTGTTCTTC
AGTTGAGAATCCGGTTATGTCTATCCCTGTGATATCAGAGAATAATTTTCACGACCCTGCTGCTGGAGTTCATCCCCAGAATGGCATTGACTTTTCAGGCTATGCACTAG
CTTCTCGACCGAACCCTTTTCAGCATCAGGGATTTCAGTTTGTTCAAGCAGGTGCACCGGTAGAGAGCTGCCTTCCTGCCGCGTATCCGATGACTTCTTACTATCCAGTC
CAGCAGTCTCAGTTTCTGCATTACCAGCCGATGCCGAACCATATGTATCCTGTCTACCTTTTGCCTGTTGGGCAGACACAAGTTTCAGCCCCTTCCAATCTACCCTTGCA
GTGGGGTGTGCACAATACTGGAACCGGGAGTTCGACTCATGTTTCGGTCCTTCCTGATGCTTCTCCTGTTCTTCCTCTTGCTCAAGTAGCTTACAAGGAGGTGTTGCCTG
AGCCACGCTCGCAGAATTTTGTAGCAATGCCATCTCTTGCAAATCCGATTGCACTCCAGTCTGTTGATGAAGTTCAGCAGCAGCCTGTAAGCATTTCTAATGATGCTGCA
TCTGGTGAAGTTGCACGTACTCGTAACGAATGCAATGATGATGATCCTGCAAGGACCCTAATATACAAATCTCAACCTCCACCACCCCTGGTTCCTTCTCAATTGCAAAG
TAAAGCTAAAGCCTCGACGAACCTTCTGTCAGACGCGATGGCACAGCTGCAGATGATCCAAATCAAACAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGACCCACCACCGCCACATCCACCGTCGCCGCCCCAATCCGCCGTTGCCGCCTCCCCCACTAAGCTCCGTTTGATGTGCAGCTACGGCGGCCACATAACCCCACGGCC
GCGCACCAAGTCCCTCTCTTATCTGGGCGGCGAAACCCGCATAATCTCCGTCGACCCAACCGCCGTCAACACTCTCGCCGCCTTCATTTCTCATCTCCTCGCGATCCTCC
CCATTAAACCTCCCTTTTACCTCAAGTATCAGCTCCCACACTCCGCCCTCGACTCTCTCATCTCCCTCTCCTCCGATGACGACCTTCAGTTCATGCTCTGCGAGCACCTC
CGCCTCTCCTCTTCCACTTCCTCTCGCATTAGGCTCTTCGTCTTATTCCCCGAGCCAGAGAAGGCTCACAACGTTATTCATCATCCCAAGACTGAGGCCTGGTTCGTCGA
TGCGCTAAAGAGTGCGAAGATTTTGCAGAAGGGGCGCGATTGCTTCGTGGGGTTTGATGGGGACGGGTTGATTGGAGAGAATGATGCTAAGGGTGTTGCAGATTTGGGTA
ATGGGGGTGTTTCTTTGGCTGAATCCATGGTCTTGGAGACCAGTTCTTCCTTTGGATCATCGTCGTCTTCCGCTTCTTTGGCTAATGTGTCTCCTCCTGTTAAACCTCAG
AGTGAGGACTTTGGACTCAGTTCATTGGATAATGCTGTCAAGCTGCAGACAACTTCCGATTCCGTCGCAACACTCGCGAGCGAGATTGCCCCTACCAACTCTTGTTCTTC
AGTTGAGAATCCGGTTATGTCTATCCCTGTGATATCAGAGAATAATTTTCACGACCCTGCTGCTGGAGTTCATCCCCAGAATGGCATTGACTTTTCAGGCTATGCACTAG
CTTCTCGACCGAACCCTTTTCAGCATCAGGGATTTCAGTTTGTTCAAGCAGGTGCACCGGTAGAGAGCTGCCTTCCTGCCGCGTATCCGATGACTTCTTACTATCCAGTC
CAGCAGTCTCAGTTTCTGCATTACCAGCCGATGCCGAACCATATGTATCCTGTCTACCTTTTGCCTGTTGGGCAGACACAAGTTTCAGCCCCTTCCAATCTACCCTTGCA
GTGGGGTGTGCACAATACTGGAACCGGGAGTTCGACTCATGTTTCGGTCCTTCCTGATGCTTCTCCTGTTCTTCCTCTTGCTCAAGTAGCTTACAAGGAGGTGTTGCCTG
AGCCACGCTCGCAGAATTTTGTAGCAATGCCATCTCTTGCAAATCCGATTGCACTCCAGTCTGTTGATGAAGTTCAGCAGCAGCCTGTAAGCATTTCTAATGATGCTGCA
TCTGGTGAAGTTGCACGTACTCGTAACGAATGCAATGATGATGATCCTGCAAGGACCCTAATATACAAATCTCAACCTCCACCACCCCTGGTTCCTTCTCAATTGCAAAG
TAAAGCTAAAGCCTCGACGAACCTTCTGTCAGACGCGATGGCACAGCTGCAGATGATCCAAATCAAACAATAA
Protein sequenceShow/hide protein sequence
MDPPPPHPPSPPQSAVAASPTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTAVNTLAAFISHLLAILPIKPPFYLKYQLPHSALDSLISLSSDDDLQFMLCEHL
RLSSSTSSRIRLFVLFPEPEKAHNVIHHPKTEAWFVDALKSAKILQKGRDCFVGFDGDGLIGENDAKGVADLGNGGVSLAESMVLETSSSFGSSSSSASLANVSPPVKPQ
SEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSSVENPVMSIPVISENNFHDPAAGVHPQNGIDFSGYALASRPNPFQHQGFQFVQAGAPVESCLPAAYPMTSYYPV
QQSQFLHYQPMPNHMYPVYLLPVGQTQVSAPSNLPLQWGVHNTGTGSSTHVSVLPDASPVLPLAQVAYKEVLPEPRSQNFVAMPSLANPIALQSVDEVQQQPVSISNDAA
SGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQLQMIQIKQ