; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020936 (gene) of Snake gourd v1 genome

Gene IDTan0020936
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPB1 domain-containing protein
Genome locationLG11:1181865..1184212
RNA-Seq ExpressionTan0020936
SyntenyTan0020936
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000270 - PB1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051068.1 Phox/Bem1p [Cucumis melo var. makuwa]7.2e-19074.85Show/hide
Query:  MCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQFMLCEHLRL--SSSSSSRIRLF
        MCSYGGHIT RPRTKSLSYLGGETRIISVDPTTVNTLS+FISHLLTILP+KPPFSLKY LP SALDSLISLSSD DL FM  EHLRL  SSSSSSRIRLF
Subjt:  MCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQFMLCEHLRL--SSSSSSRIRLF

Query:  VFFPEPEKTHNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLETSSSFGSSSSSTSLANVPTAIKPQSED
        +FFPEPEK HNVIHHPKTEAWF DALKSAKIL KGRDCLVGFDGEGL GENE KG+ DL NGG SLPESMVLETSSSFGSSSSS SLANV   IK QSED
Subjt:  VFFPEPEKTHNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLETSSSFGSSSSSTSLANVPTAIKPQSED

Query:  FGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQPNSFQQQALQFVQAGAPVESCLPAVYP
        +GLSS+         SD VATLAS+IAPTNSCS VEN V S+P+ISESNFH+ AAGV  +NP DFSGYA   QPN FQ Q LQFVQ   PVESCLP VY 
Subjt:  FGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQPNSFQQQALQFVQAGAPVESCLPAVYP

Query:  MASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPV-----------------QWGLHDAVTGSLSHASVLPDASPVLPLPQVAYKEVTPEPR
        M SYYP QQPQFVHYQ MPNHMYPVY+LPVGQTQVSAPSNLPV                 QWGLHD  T + +H+ VLPDASPV+PLPQVAYKE+ PEP 
Subjt:  MASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPV-----------------QWGLHDAVTGSLSHASVLPDASPVLPLPQVAYKEVTPEPR

Query:  TQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQLQMIQIKQ
        +QN GAM  LANPI   S DEVQQ PV I ND AA    EV RT +ECND+DP RTLIYKSQP PP     LQSK K STNLLSDAMAQLQMI+I Q
Subjt:  TQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQLQMIQIKQ

XP_008441273.1 PREDICTED: uncharacterized protein LOC103485455 [Cucumis melo]6.3e-19474.14Show/hide
Query:  MDPPPPRPPSSPQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDD
        MDPPP  PPS   S TA    KLRLMCSYGGHIT RPRTKSLSYLGGETRIISVDPTTVNTLS+FISHLLTILP+KPPFSLKY LP SALDSLISLSSD 
Subjt:  MDPPPPRPPSSPQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDD

Query:  DLQFMLCEHLRL--SSSSSSRIRLFVFFPEPEKTHNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLETS
        DL FM  EHLRL  SSSSSSRIRLF+FFPEPEK HNVIHHPKTEAWF DALKSAKIL KGRDCLVGFDGEGL GENE KG+ DL NGG SLPESMVLETS
Subjt:  DLQFMLCEHLRL--SSSSSSRIRLFVFFPEPEKTHNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLETS

Query:  SSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQPN
        SSFGSSSSS SLANV   IK QSED+GLSS+         SD VATLAS+IAPTNSCS VEN V S+P+ISESNFH+ AAGV  +NP DFSGYA   QPN
Subjt:  SSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQPN

Query:  SFQQQALQFVQAGAPVESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPV-----------------QWGLHDAVTGSLSHA
         FQ Q LQFVQ   PVESCLP VY M SYYP QQPQFVHYQ MPNHMYPVY+LPVGQTQVSAPSNLPV                 QWGLHD  T + +H+
Subjt:  SFQQQALQFVQAGAPVESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPV-----------------QWGLHDAVTGSLSHA

Query:  SVLPDASPVLPLPQVAYKEVTPEPRTQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSK
         VLPDASPV+PLPQVAYKE+ PEP +QN GAM  LANPI   S DEVQQ PV I ND AA    EV RT +ECND+DP RTLIYKSQP PP     LQSK
Subjt:  SVLPDASPVLPLPQVAYKEVTPEPRTQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSK

Query:  AKASTNLLSDAMAQLQMIQIKQ
         K STNLLSDAMAQLQMI+I Q
Subjt:  AKASTNLLSDAMAQLQMIQIKQ

XP_022133762.1 uncharacterized protein LOC111006259 [Momordica charantia]3.1e-20979.09Show/hide
Query:  PPRPPSSPQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQF
        PP+PPS PQS   AA  KLRLMCSYGG ITPRPRTKSL YLGGETRIISVDP  VNTLSAFISHLLTIL + PPF+LKYQLPHSALDSLISLSSDDDLQF
Subjt:  PPRPPSSPQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQF

Query:  MLCEHLRLSSSSS-----SRIRLFVFFPEPEK---THNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLE
        MLCEHLRLSSS S     SRIRLFVFFPEPEK     NVIHHPKTEAW VDAL+SAKIL KGRDC VGFDGEGL GENE KGV DL  GGVSL ESMVLE
Subjt:  MLCEHLRLSSSSS-----SRIRLFVFFPEPEK---THNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLE

Query:  TSSSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQ
        TSSSFGSSSSS SLANVP  IKP +EDF  SSLDNAV            ASEIA T+SCS +EN VMSIP+ISES FHDPAAG+HPQN IDFSGY LA +
Subjt:  TSSSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQ

Query:  PNSFQQQALQFVQAGAPVESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPVQWGLHDAVTGSLSHASVLPDASPVLPLPQV
        PNSFQ QALQFVQAG PVESCLP+++PMASYYP QQPQF+HYQ MPNHMYP+YFLPVGQTQVS PSNLP+QWGL DA TGSLSHA ++PDASPV  L QV
Subjt:  PNSFQQQALQFVQAGAPVESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPVQWGLHDAVTGSLSHASVLPDASPVLPLPQV

Query:  AYKEVTPEPRTQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQL
        AYKEV PEP  QNFGA  PLANP+A++  DEV+QQPVSISND A A+SGEVAR RNECNDDD ART IYKSQPPPPLVPSQLQSKA AST LLSDAMAQL
Subjt:  AYKEVTPEPRTQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQL

Query:  QMIQIKQ
        QMI+IKQ
Subjt:  QMIQIKQ

XP_023536931.1 uncharacterized protein LOC111798160 [Cucurbita pepo subsp. pepo]2.4e-18574.11Show/hide
Query:  PPSSPQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQFMLC
        P S PQ A  AA TKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLS+FISHLLTILP+KPPFSLKYQLP+SALDSLISLSSDDDLQ MLC
Subjt:  PPSSPQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQFMLC

Query:  EHLRLSSSSSSRIRLFVFFPEPEKTHNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLETSSSFGSSSSS
         HL LSSS+SSRIRLF+ FPEPEKT NVIHHPKTEAWFVDALKSAKI  KGRD LVGFDG+ L GENEAK VADL NGGVSL ESM+LETSSSF SSSSS
Subjt:  EHLRLSSSSSSRIRLFVFFPEPEKTHNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLETSSSFGSSSSS

Query:  TSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQPNSFQQQALQF
                       DFGLSSLDN VKL TTSDSVATL+S           ENPV       +SNFH   +GV+PQNPI FSGYALAS+PN FQQQALQF
Subjt:  TSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQPNSFQQQALQF

Query:  VQAGAPVESC-LPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPVQWGLHDAVTGSLSHASVLPDASPVLPLPQVAYKEVTPEPR
        V+A   V+SC LPAVYPM SYYPVQQPQFVHYQ MP+H+YP+YFLPVGQTQVSAPSNLP QW LH+A TGSLSH+           LPQVAYKEVTPEPR
Subjt:  VQAGAPVESC-LPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPVQWGLHDAVTGSLSHASVLPDASPVLPLPQVAYKEVTPEPR

Query:  TQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECND--------DDPARTLIYKSQ--PPPPLVPSQLQSKAKASTNLLSDAMAQL
        TQ FGAM       AIKS DEVQQQPV+ISND A A S EVA T NECND        DDP RTLIYKSQ  PPPPLVPSQLQSKA+A+TN+LSDAM+QL
Subjt:  TQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECND--------DDPARTLIYKSQ--PPPPLVPSQLQSKAKASTNLLSDAMAQL

Query:  QMIQIK
        QMIQ K
Subjt:  QMIQIK

XP_038884113.1 uncharacterized protein LOC120075037 [Benincasa hispida]6.9e-22582.77Show/hide
Query:  MDPPPPRPPSSPQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDD
        MDPPP      PQSAT A  TKLRLMCSYGGHIT RPRTK+ SYLGGETRIISVDPTTVNTLSAFISHLLTILP+K PFSLKY LPHSALDSLISLSSDD
Subjt:  MDPPPPRPPSSPQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDD

Query:  DLQFMLCEHLRL--SSSSSSRIRLFVFFPEPEKTHNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLETS
        DL FM CEHLRL  SSSSSSRIRLF+F PEPEK HNVIHHPKTEAWFVDALKSAKIL KGRDCLVGFDGEGL GENE KGV DL NGG SLPESMVLETS
Subjt:  DLQFMLCEHLRL--SSSSSSRIRLFVFFPEPEKTHNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLETS

Query:  SSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQPN
        SSFGSSSSS SLANV    KPQ+EDFGLSSLDNA  LQT SDS+ATLASEIAPTNSCS VEN VMSIP+ISESNFH+ AAGV PQNP DFSGYALAS+PN
Subjt:  SSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQPN

Query:  SFQQQALQFVQAGAPVESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPVQWGLHDAVTGSLSHASVLPDASPVLPLPQVAY
         FQQQ LQFVQA A VESCLPAVY M SYYPVQQPQFVHYQ MPNH+YPVYFLPVGQTQ+SAPSNLPVQWGL DA T S +H  VLPDASPV+PLP VAY
Subjt:  SFQQQALQFVQAGAPVESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPVQWGLHDAVTGSLSHASVLPDASPVLPLPQVAY

Query:  KEVTPEPRTQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQLQM
        KEV PEP +QN GAM  LANPI+++S DEVQQQPV I ND AA  SGEVA TRNECN+DDPARTLIYKSQP PPLVPS LQSK KASTNLLSDAMAQL M
Subjt:  KEVTPEPRTQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQLQM

Query:  IQIKQ
        I+I+Q
Subjt:  IQIKQ

TrEMBL top hitse value%identityAlignment
A0A0A0LMW1 PB1 domain-containing protein1.2e-19572.99Show/hide
Query:  MDPPPPRPPSSPQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDD
        MDPPPP     P S  A  +TKLRLMCSY GHIT RPRTKSLSYLGGETRIISVDPTTVNTLS FISHLLTILP+KPPFSLKY LPHSALDSLISLSS D
Subjt:  MDPPPPRPPSSPQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDD

Query:  DLQFMLCEHLRL--SSSSSSRIRLFVFFPEPEKTHNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLETS
        DL FM  EHLRL  SSSSSSRIRLF+FFPEPEK HNVIHHPKTEAWF DALKSAKIL KGRDCLVGFDGEGL GENE KG+ DL NGG SLPESMVLETS
Subjt:  DLQFMLCEHLRL--SSSSSSRIRLFVFFPEPEKTHNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLETS

Query:  SSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQPN
        SSFGSSSSS SLANV   IKPQSEDFGLSS+         SDSVATLAS+I PTNSCS VEN V S+P+I+ESNFH+ AAGV  +NP DFSGYA   +PN
Subjt:  SSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQPN

Query:  SFQQQALQFVQAGAPVESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSA-----------------PSNLPVQWGLHDAVTGSLSHA
         FQ Q LQFVQ   PVESCLP VY M SYYPVQQPQFVHYQ MPNHMYPVY+LPVGQTQ+SA                 PSNLP+QWGLH+  T   +H+
Subjt:  SFQQQALQFVQAGAPVESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSA-----------------PSNLPVQWGLHDAVTGSLSHA

Query:  SVLPDASPVLPLPQVAYKEVTPEPRTQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSK
         VLPDASPV+PLPQVAYKE+ PE  +QN GAM  LANP +++S DEVQQ PV I ND AA  S EV  T +E N+DDP RTLIYKSQP PP     LQSK
Subjt:  SVLPDASPVLPLPQVAYKEVTPEPRTQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSK

Query:  AKASTNLLSDAMAQLQMIQIKQ
         +ASTNLLSDAMAQLQMI+I Q
Subjt:  AKASTNLLSDAMAQLQMIQIKQ

A0A1S3B318 uncharacterized protein LOC1034854553.0e-19474.14Show/hide
Query:  MDPPPPRPPSSPQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDD
        MDPPP  PPS   S TA    KLRLMCSYGGHIT RPRTKSLSYLGGETRIISVDPTTVNTLS+FISHLLTILP+KPPFSLKY LP SALDSLISLSSD 
Subjt:  MDPPPPRPPSSPQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDD

Query:  DLQFMLCEHLRL--SSSSSSRIRLFVFFPEPEKTHNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLETS
        DL FM  EHLRL  SSSSSSRIRLF+FFPEPEK HNVIHHPKTEAWF DALKSAKIL KGRDCLVGFDGEGL GENE KG+ DL NGG SLPESMVLETS
Subjt:  DLQFMLCEHLRL--SSSSSSRIRLFVFFPEPEKTHNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLETS

Query:  SSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQPN
        SSFGSSSSS SLANV   IK QSED+GLSS+         SD VATLAS+IAPTNSCS VEN V S+P+ISESNFH+ AAGV  +NP DFSGYA   QPN
Subjt:  SSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQPN

Query:  SFQQQALQFVQAGAPVESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPV-----------------QWGLHDAVTGSLSHA
         FQ Q LQFVQ   PVESCLP VY M SYYP QQPQFVHYQ MPNHMYPVY+LPVGQTQVSAPSNLPV                 QWGLHD  T + +H+
Subjt:  SFQQQALQFVQAGAPVESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPV-----------------QWGLHDAVTGSLSHA

Query:  SVLPDASPVLPLPQVAYKEVTPEPRTQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSK
         VLPDASPV+PLPQVAYKE+ PEP +QN GAM  LANPI   S DEVQQ PV I ND AA    EV RT +ECND+DP RTLIYKSQP PP     LQSK
Subjt:  SVLPDASPVLPLPQVAYKEVTPEPRTQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSK

Query:  AKASTNLLSDAMAQLQMIQIKQ
         K STNLLSDAMAQLQMI+I Q
Subjt:  AKASTNLLSDAMAQLQMIQIKQ

A0A5D3BU91 Phox/Bem1p3.5e-19074.85Show/hide
Query:  MCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQFMLCEHLRL--SSSSSSRIRLF
        MCSYGGHIT RPRTKSLSYLGGETRIISVDPTTVNTLS+FISHLLTILP+KPPFSLKY LP SALDSLISLSSD DL FM  EHLRL  SSSSSSRIRLF
Subjt:  MCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQFMLCEHLRL--SSSSSSRIRLF

Query:  VFFPEPEKTHNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLETSSSFGSSSSSTSLANVPTAIKPQSED
        +FFPEPEK HNVIHHPKTEAWF DALKSAKIL KGRDCLVGFDGEGL GENE KG+ DL NGG SLPESMVLETSSSFGSSSSS SLANV   IK QSED
Subjt:  VFFPEPEKTHNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLETSSSFGSSSSSTSLANVPTAIKPQSED

Query:  FGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQPNSFQQQALQFVQAGAPVESCLPAVYP
        +GLSS+         SD VATLAS+IAPTNSCS VEN V S+P+ISESNFH+ AAGV  +NP DFSGYA   QPN FQ Q LQFVQ   PVESCLP VY 
Subjt:  FGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQPNSFQQQALQFVQAGAPVESCLPAVYP

Query:  MASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPV-----------------QWGLHDAVTGSLSHASVLPDASPVLPLPQVAYKEVTPEPR
        M SYYP QQPQFVHYQ MPNHMYPVY+LPVGQTQVSAPSNLPV                 QWGLHD  T + +H+ VLPDASPV+PLPQVAYKE+ PEP 
Subjt:  MASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPV-----------------QWGLHDAVTGSLSHASVLPDASPVLPLPQVAYKEVTPEPR

Query:  TQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQLQMIQIKQ
        +QN GAM  LANPI   S DEVQQ PV I ND AA    EV RT +ECND+DP RTLIYKSQP PP     LQSK K STNLLSDAMAQLQMI+I Q
Subjt:  TQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQLQMIQIKQ

A0A6J1BXN6 uncharacterized protein LOC1110062591.5e-20979.09Show/hide
Query:  PPRPPSSPQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQF
        PP+PPS PQS   AA  KLRLMCSYGG ITPRPRTKSL YLGGETRIISVDP  VNTLSAFISHLLTIL + PPF+LKYQLPHSALDSLISLSSDDDLQF
Subjt:  PPRPPSSPQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQF

Query:  MLCEHLRLSSSSS-----SRIRLFVFFPEPEK---THNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLE
        MLCEHLRLSSS S     SRIRLFVFFPEPEK     NVIHHPKTEAW VDAL+SAKIL KGRDC VGFDGEGL GENE KGV DL  GGVSL ESMVLE
Subjt:  MLCEHLRLSSSSS-----SRIRLFVFFPEPEK---THNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLE

Query:  TSSSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQ
        TSSSFGSSSSS SLANVP  IKP +EDF  SSLDNAV            ASEIA T+SCS +EN VMSIP+ISES FHDPAAG+HPQN IDFSGY LA +
Subjt:  TSSSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQ

Query:  PNSFQQQALQFVQAGAPVESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPVQWGLHDAVTGSLSHASVLPDASPVLPLPQV
        PNSFQ QALQFVQAG PVESCLP+++PMASYYP QQPQF+HYQ MPNHMYP+YFLPVGQTQVS PSNLP+QWGL DA TGSLSHA ++PDASPV  L QV
Subjt:  PNSFQQQALQFVQAGAPVESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPVQWGLHDAVTGSLSHASVLPDASPVLPLPQV

Query:  AYKEVTPEPRTQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQL
        AYKEV PEP  QNFGA  PLANP+A++  DEV+QQPVSISND A A+SGEVAR RNECNDDD ART IYKSQPPPPLVPSQLQSKA AST LLSDAMAQL
Subjt:  AYKEVTPEPRTQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQL

Query:  QMIQIKQ
        QMI+IKQ
Subjt:  QMIQIKQ

A0A6J1E0T6 uncharacterized protein LOC1114297791.6e-18273.36Show/hide
Query:  PPSSPQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQFMLC
        P S PQ A  AA TKLRLMCSYGGH+TPRPRTKSLSYLGGETRIISVDPTTVNTLS+FISHLLTILP+KPPFSLKYQLPHS LDSLISLSSDDDLQ ML 
Subjt:  PPSSPQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQFMLC

Query:  EHLRLSSSSSSRIRLFVFFPEPEKTHNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLETSSSFGSSSSS
         HL LSSS+SSRIRLF+ FPEPEKT NVIHHPKTEAWFVDALKSAKI  KGRD LVGFDG+ L GENEAK VADL NGGVSL ESM+LETSSSF SSSSS
Subjt:  EHLRLSSSSSSRIRLFVFFPEPEKTHNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLETSSSFGSSSSS

Query:  TSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQPNSFQQQALQF
                       DFGLSSLDN VKL TTSDSVATL+S           ENPV       +SNFH   +GV+PQNPI FSGYALAS+PNSFQQQAL+ 
Subjt:  TSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQPNSFQQQALQF

Query:  VQAGAPVESC-LPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPVQWGLHDAVTGSLSHASVLPDASPVLPLPQVAYKEVTPEPR
              V+SC LPAVYPM SYYPVQQPQFVHYQ MP+H+YPVY LPVGQT+VSAPSNLP QW LH+A TGSLSH           PLPQVAYKEVTPEPR
Subjt:  VQAGAPVESC-LPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPVQWGLHDAVTGSLSHASVLPDASPVLPLPQVAYKEVTPEPR

Query:  TQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNEC-----NDDDPARTLIYKSQ--PPPPLVPSQLQSKAKASTNLLSDAMAQLQMI
        TQ FGAM       A+KS D VQQQPV+ISND AAA SGEVA T NEC     N+DDP RTLIYKSQ  PPPPLVPSQLQSKA+A+TN+LSDAM+QLQMI
Subjt:  TQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNEC-----NDDDPARTLIYKSQ--PPPPLVPSQLQSKAKASTNLLSDAMAQLQMI

Query:  QIK
        Q K
Subjt:  QIK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01190.1 Octicosapeptide/Phox/Bem1p family protein5.8e-2829.82Show/hide
Query:  PPPRPP-----------SSPQSAT-------------------AAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTIL
        PPP PP           SSP+S T                   +A  +KLR MCSYGGHI PRP  KSL Y+GG+TRI+ VD    ++L + I+ L   L
Subjt:  PPPRPP-----------SSPQSAT-------------------AAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTIL

Query:  PVKPPFSLKYQLPHSALDSLISLSSDDDLQFMLCEHLRLSSSSS----SRIRLFVFFPEPEKTHN----VIHHPKTEAWFVDALKSAKILHKG-------
             F+LKYQLP   LDSLIS+++D+DL  M+ E+ R  S+S+    SR+RLF+F  +PE T +    +    K++ WF++AL SA +L++G       
Subjt:  PVKPPFSLKYQLPHSALDSLISLSSDDDLQFMLCEHLRLSSSSS----SRIRLFVFFPEPEKTHN----VIHHPKTEAWFVDALKSAKILHKG-------

Query:  RDCLVGFDG----EGLTGENEAKGVAD------------------LANGGVS---LPESMVLETSSSFGSSSSSTSLANVPTAIKPQSEDFGLSSL----
         + L+G D        +G+N  +   D                     GG     LP+S +L+TSSSFGS+SSS SLAN+P       E  G+ +L    
Subjt:  RDCLVGFDG----EGLTGENEAKGVAD------------------LANGGVS---LPESMVLETSSSFGSSSSSTSLANVPTAIKPQSEDFGLSSL----

Query:  ---------------------DNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVH-PQNPIDFSGYALASQPNSFQQQALQF
                             D    + +      T+A   AP  + ++       +    E + H   AG   P  P         SQP +   Q    
Subjt:  ---------------------DNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVH-PQNPIDFSGYALASQPNSFQQQALQF

Query:  VQAGAPVESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSN
        +++ +     LP+   ++S   +  P F  +Q    +  P+  +P G T V+   N
Subjt:  VQAGAPVESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSN

AT3G18230.1 Octicosapeptide/Phox/Bem1p family protein2.3e-2930.66Show/hide
Query:  PQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQFMLCEHLR
        P+   A    KLRLMCS+GGHI PRP  KSL+Y GGETRI+ VD     +LS+  S L ++L     F+LKYQLP   LDSL+++++D+DL+ M+ E+ R
Subjt:  PQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQFMLCEHLR

Query:  LSSSSSS----RIRLFVFFPEPEKT---HNVIHHPKTEAWFVDALKSAKILHKG-------RDCLVGFD----GE--------GLTGENEAKGVADLANG
         +SS+++    R+RLF+F  + E      +++   K++ WFVDAL  + +L +G        + LV  D    GE         + GEN  +G  DL   
Subjt:  LSSSSSS----RIRLFVFFPEPEKT---HNVIHHPKTEAWFVDALKSAKILHKG-------RDCLVGFD----GE--------GLTGENEAKGVADLANG

Query:  GV---------SLPESMVLETS-SSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHD
        GV         S+P+S ++E + SS GSSSSS S +N+P      SE       D  ++ Q    + + + ++    +  SL+ N  M IP         
Subjt:  GV---------SLPESMVLETS-SSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHD

Query:  PAAGVHPQNPIDFSGYALASQPNSFQQQALQFVQAGAPVESCLPAVYPMASYYPVQQPQFVH--YQLMPNHMYPVYFLP--VGQTQVSAPSNLPVQWGLH
                             P +    A+ +    AP +   PA     S    +    V   Y+  P  M PV   P  VG   +++P ++     + 
Subjt:  PAAGVHPQNPIDFSGYALASQPNSFQQQALQFVQAGAPVESCLPAVYPMASYYPVQQPQFVH--YQLMPNHMYPVYFLP--VGQTQVSAPSNLPVQWGLH

Query:  DAVTGSLSHASVLPDASPVLP-LPQVAYKEVTPE------PRTQNFGAMSP---LANPIAIKSTDEVQQQPVS
         A   S S      D  P LP  P VA  E T        P+T+N  A +    L+ P    + D+ QQQP++
Subjt:  DAVTGSLSHASVLPDASPVLP-LPQVAYKEVTPE------PRTQNFGAMSP---LANPIAIKSTDEVQQQPVS

AT4G05150.1 Octicosapeptide/Phox/Bem1p family protein3.6e-1427.57Show/hide
Query:  PPPPRPPSSPQSATAA---ALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSD
        PPP     S  S+  +   +  ++R MC++GG I PRP    L Y+GG+ R+++V   T  T ++ +S L   L  K   S+KYQLP+  LD+LIS+S+D
Subjt:  PPPPRPPSSPQSATAA---ALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSD

Query:  DDLQFMLCEHLRLSSSS---SSRIRLFVFFPEPEKTHNVIHHP-----------------KTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGV
        +D++ M+ E+ R++ +    +SR+RLF+F      T NV                       E WF+DAL             V   G G   E     V
Subjt:  DDLQFMLCEHLRLSSSS---SSRIRLFVFFPEPEKTHNVIHHP-----------------KTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGV

Query:  ADLANGGVSLPESMVLETSSSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFH-DPAA
        + + +    +P+ +       FG       L N      P      L   D   K+Q     V+TL+   +P            S P++  S     P  
Subjt:  ADLANGGVSLPESMVLETSSSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFH-DPAA

Query:  GVHPQNPIDFSGYALASQPNSFQQQALQFVQAGAPVESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPVQWGLHDAVTGS
         + P++P   S      QP    Q      Q+  PV S           Y     Q VHYQ    H  PVY++P      S P N  VQ G H    G+
Subjt:  GVHPQNPIDFSGYALASQPNSFQQQALQFVQAGAPVESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPVQWGLHDAVTGS

AT5G09620.1 Octicosapeptide/Phox/Bem1p family protein4.3e-1544.35Show/hide
Query:  KLRLMCSYGGHITPRPRTKSLSYLGGETRIISVD-----PTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQFMLCEHLRL--SS
        K++LMCSYGG I PRP    L+Y+ G+T+I+SVD     P  V+ LSA  S            S KYQLP   LD+LIS+++D+DL+ M+ E+ RL   S
Subjt:  KLRLMCSYGGHITPRPRTKSLSYLGGETRIISVD-----PTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQFMLCEHLRL--SS

Query:  SSSSRIRLFVFFPEP
        +  +R+RLF+F   P
Subjt:  SSSSRIRLFVFFPEP

AT5G16220.1 Octicosapeptide/Phox/Bem1p family protein1.7e-4332.47Show/hide
Query:  KLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQFMLCEHLRLSSSSS---S
        KLR+MC YGG I   P+TKS  Y+GG+TRI+++  +   + ++ +SHL   L +  PF +KYQLP   LDSLIS+ +D+D+Q M+ EH  LSS SS   S
Subjt:  KLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQFMLCEHLRLSSSSS---S

Query:  RIRLFVF------------------------------FPEPEK------THNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADL
        RIRLF+F                                E  K      T  V+ HPKTE WFVDALKS +++   R         G +G  +       
Subjt:  RIRLFVF------------------------------FPEPEK------THNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADL

Query:  ANGGVSLPESMVLETSSSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTT----SDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAA
         NGG+   ESM+LET+SSFGS+SSS S +N+P  IK   ED   +S      +++     + +V  + S   P++S +    P         SN +   A
Subjt:  ANGGVSLPESMVLETSSSFGSSSSSTSLANVPTAIKPQSEDFGLSSLDNAVKLQTT----SDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAA

Query:  GVHPQNPIDFSGYALASQPNSFQQQALQFVQAGAP-VESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPVQWGLHDAVTGS
         ++   P+  SGY      N  QQQ +Q +  G P +    P   P  +Y+       V+YQ  P   YP+Y++PV Q        LPV+          
Subjt:  GVHPQNPIDFSGYALASQPNSFQQQALQFVQAGAP-VESCLPAVYPMASYYPVQQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPVQWGLHDAVTGS

Query:  LSHASVLPDASPVLPLPQVAYKEV-TPEPRTQNFGA-MSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVP
                  S VL   QV    V T  P    F + + PL+ P+   S     +  ++ ++  A   + +V       +D+D A   IYKSQPP P +P
Subjt:  LSHASVLPDASPVLPLPQVAYKEV-TPEPRTQNFGA-MSPLANPIAIKSTDEVQQQPVSISNDGAAAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVP

Query:  SQ
        SQ
Subjt:  SQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCCACCGCCGCCACGCCCACCGTCCTCGCCCCAATCCGCCACCGCCGCCGCCCTCACGAAGCTCCGTTTGATGTGCAGCTACGGCGGCCACATAACCCCACGCCC
GCGCACCAAGTCCCTCTCCTATTTGGGCGGCGAAACCCGCATTATCTCCGTCGACCCAACCACCGTCAACACCCTCTCCGCCTTCATTTCTCATCTCCTCACAATTCTTC
CCGTTAAACCCCCCTTTTCCCTCAAGTATCAGCTCCCTCACTCCGCCCTCGACTCTCTCATCTCCCTCTCCTCCGATGACGACCTTCAATTCATGCTCTGCGAGCACCTT
CGCCTCTCCTCTTCCTCTTCCTCTCGCATTAGGCTCTTCGTCTTTTTCCCTGAGCCCGAGAAGACCCATAATGTTATTCATCATCCCAAGACCGAGGCCTGGTTCGTCGA
TGCGCTTAAGAGCGCGAAGATTTTGCACAAGGGGCGCGATTGCTTGGTGGGTTTTGATGGCGAGGGGTTGACTGGAGAGAATGAAGCTAAGGGTGTTGCAGATTTGGCTA
ATGGCGGTGTTTCTTTGCCCGAATCCATGGTCTTGGAGACTAGTTCTTCTTTTGGATCATCGTCTTCTTCGACTTCTTTGGCTAATGTGCCTACTGCCATTAAGCCTCAG
AGTGAGGATTTTGGACTCAGTTCGCTGGATAATGCTGTTAAGCTGCAGACAACTTCTGATTCCGTCGCAACCCTTGCGAGTGAGATCGCCCCTACCAACTCTTGTTCTTT
AGTTGAGAATCCGGTTATGTCTATCCCTATGATATCAGAGAGTAATTTTCATGACCCCGCTGCTGGAGTTCATCCCCAGAACCCCATTGACTTTTCAGGTTATGCACTAG
CTTCCCAACCGAACTCTTTTCAGCAGCAGGCATTGCAGTTTGTTCAAGCAGGTGCACCCGTAGAAAGCTGCCTTCCTGCCGTGTATCCGATGGCTTCTTACTATCCAGTC
CAGCAACCTCAGTTTGTGCATTACCAGCTGATGCCGAACCATATGTATCCTGTCTACTTTTTGCCTGTTGGGCAGACACAGGTTTCAGCCCCTTCCAATCTACCTGTGCA
GTGGGGCTTGCACGATGCTGTGACAGGGAGTTTAAGTCATGCTTCGGTTCTGCCTGATGCGTCTCCTGTTCTTCCTCTTCCTCAAGTAGCTTACAAGGAGGTGACACCCG
AGCCACGCACTCAAAATTTTGGAGCAATGTCACCTCTTGCAAATCCAATTGCTATCAAGTCTACTGATGAAGTTCAGCAGCAGCCTGTAAGCATTTCTAATGATGGTGCA
GCTGCTGTATCTGGTGAAGTCGCACGTACTCGTAATGAATGTAACGACGATGATCCAGCAAGAACCTTGATATACAAATCTCAGCCTCCACCACCCCTGGTTCCTTCTCA
GTTGCAAAGTAAAGCTAAAGCCTCGACGAACCTTCTGTCAGATGCGATGGCACAGCTGCAGATGATCCAAATCAAGCAATGA
mRNA sequenceShow/hide mRNA sequence
CTTCCATTCCCATTTGGTTCCCTCCCCTGTTTTGAGTATCCTCCCGTACGCTCCCCTCTAGTCCCCGATCTCAGAAACCCAAATGGACCCACCGCCGCCACGCCCACCGT
CCTCGCCCCAATCCGCCACCGCCGCCGCCCTCACGAAGCTCCGTTTGATGTGCAGCTACGGCGGCCACATAACCCCACGCCCGCGCACCAAGTCCCTCTCCTATTTGGGC
GGCGAAACCCGCATTATCTCCGTCGACCCAACCACCGTCAACACCCTCTCCGCCTTCATTTCTCATCTCCTCACAATTCTTCCCGTTAAACCCCCCTTTTCCCTCAAGTA
TCAGCTCCCTCACTCCGCCCTCGACTCTCTCATCTCCCTCTCCTCCGATGACGACCTTCAATTCATGCTCTGCGAGCACCTTCGCCTCTCCTCTTCCTCTTCCTCTCGCA
TTAGGCTCTTCGTCTTTTTCCCTGAGCCCGAGAAGACCCATAATGTTATTCATCATCCCAAGACCGAGGCCTGGTTCGTCGATGCGCTTAAGAGCGCGAAGATTTTGCAC
AAGGGGCGCGATTGCTTGGTGGGTTTTGATGGCGAGGGGTTGACTGGAGAGAATGAAGCTAAGGGTGTTGCAGATTTGGCTAATGGCGGTGTTTCTTTGCCCGAATCCAT
GGTCTTGGAGACTAGTTCTTCTTTTGGATCATCGTCTTCTTCGACTTCTTTGGCTAATGTGCCTACTGCCATTAAGCCTCAGAGTGAGGATTTTGGACTCAGTTCGCTGG
ATAATGCTGTTAAGCTGCAGACAACTTCTGATTCCGTCGCAACCCTTGCGAGTGAGATCGCCCCTACCAACTCTTGTTCTTTAGTTGAGAATCCGGTTATGTCTATCCCT
ATGATATCAGAGAGTAATTTTCATGACCCCGCTGCTGGAGTTCATCCCCAGAACCCCATTGACTTTTCAGGTTATGCACTAGCTTCCCAACCGAACTCTTTTCAGCAGCA
GGCATTGCAGTTTGTTCAAGCAGGTGCACCCGTAGAAAGCTGCCTTCCTGCCGTGTATCCGATGGCTTCTTACTATCCAGTCCAGCAACCTCAGTTTGTGCATTACCAGC
TGATGCCGAACCATATGTATCCTGTCTACTTTTTGCCTGTTGGGCAGACACAGGTTTCAGCCCCTTCCAATCTACCTGTGCAGTGGGGCTTGCACGATGCTGTGACAGGG
AGTTTAAGTCATGCTTCGGTTCTGCCTGATGCGTCTCCTGTTCTTCCTCTTCCTCAAGTAGCTTACAAGGAGGTGACACCCGAGCCACGCACTCAAAATTTTGGAGCAAT
GTCACCTCTTGCAAATCCAATTGCTATCAAGTCTACTGATGAAGTTCAGCAGCAGCCTGTAAGCATTTCTAATGATGGTGCAGCTGCTGTATCTGGTGAAGTCGCACGTA
CTCGTAATGAATGTAACGACGATGATCCAGCAAGAACCTTGATATACAAATCTCAGCCTCCACCACCCCTGGTTCCTTCTCAGTTGCAAAGTAAAGCTAAAGCCTCGACG
AACCTTCTGTCAGATGCGATGGCACAGCTGCAGATGATCCAAATCAAGCAATGAATTGGAAGCCTGCAAGCACAGTGAGTTAGAGATTTTGATCACCCAGTTTTTGGGAT
TTTTTTTTTCTTTTTATGATTGCCCTTCCTGCATATGAGCTTGTGTTATAGGTGTAAAGACTGAATGCTTGCTGTTCTTATTCTTGTTCTTTTTCTGCTCTGGCTATCTT
GTATTTCATATTTCTTGTTCTTTCTTTGGTCAATATGTTAAATTTGTAAAATCATCTATATATGTTTGTATCAATTTTTTTCCTCTTTCAA
Protein sequenceShow/hide protein sequence
MDPPPPRPPSSPQSATAAALTKLRLMCSYGGHITPRPRTKSLSYLGGETRIISVDPTTVNTLSAFISHLLTILPVKPPFSLKYQLPHSALDSLISLSSDDDLQFMLCEHL
RLSSSSSSRIRLFVFFPEPEKTHNVIHHPKTEAWFVDALKSAKILHKGRDCLVGFDGEGLTGENEAKGVADLANGGVSLPESMVLETSSSFGSSSSSTSLANVPTAIKPQ
SEDFGLSSLDNAVKLQTTSDSVATLASEIAPTNSCSLVENPVMSIPMISESNFHDPAAGVHPQNPIDFSGYALASQPNSFQQQALQFVQAGAPVESCLPAVYPMASYYPV
QQPQFVHYQLMPNHMYPVYFLPVGQTQVSAPSNLPVQWGLHDAVTGSLSHASVLPDASPVLPLPQVAYKEVTPEPRTQNFGAMSPLANPIAIKSTDEVQQQPVSISNDGA
AAVSGEVARTRNECNDDDPARTLIYKSQPPPPLVPSQLQSKAKASTNLLSDAMAQLQMIQIKQ