; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1964 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1964
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPB1 domain-containing protein
Genome locationMC04:26404301..26406383
RNA-Seq ExpressionMC04g1964
SyntenyMC04g1964
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000270 - PB1 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051068.1 Phox/Bem1p [Cucumis melo var. makuwa]2.37e-20467.4Show/hide
Query:  MCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFMLCEHLRLSSSISTASAPSRI
        MCSYGG IT RPRTKSL YLGGETRIISVDP  VNTLS+FISHLLTIL I PPF+LKY LP SALDSLISLSSD DL FM  EHLRLSSS   +S+ SRI
Subjt:  MCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFMLCEHLRLSSSISTASAPSRI

Query:  RLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLETSSSFGSSSSSASLANV-PLI
        RLF+FFPEPEKP    NVIHHPKTEAW  DAL+SAKILQKGRDC VGFDGEGL+GENE KG+ DLG GG SL ESMVLETSSSFGSSSSSASLANV P I
Subjt:  RLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLETSSSFGSSSSSASLANV-PLI

Query:  KPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRPNSFQPQALQFVQAGTPVESC
        K  +ED+  SS    VA   +A     L S+IA T+SCSS+EN V S+PVISES FH+ AAG+  +N  DFSGY    +PN FQ Q LQFVQ   PVESC
Subjt:  KPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRPNSFQPQALQFVQAGTPVESC

Query:  LPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLP-----------------MQWGLRDAATGSLSHALV-PDASPVA-LSQVAYKE
        LP ++ M SYYP QQPQF+HYQPMPNHMYP+Y+LPVGQTQVS PSNLP                 MQWGL D AT + +H+LV PDASPV  L QVAYKE
Subjt:  LPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLP-----------------MQWGLRDAATGSLSHALV-PDASPVA-LSQVAYKE

Query:  VIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQSKANASTTLLSDAMAQLQMIK
        ++PEPH QN GA P LANP++   ADEV+Q PV I ND A     EV R  +ECND+D  RT IYKSQP PPL     QSK   ST LLSDAMAQLQMIK
Subjt:  VIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQSKANASTTLLSDAMAQLQMIK

Query:  IKQ
        I Q
Subjt:  IKQ

XP_008441273.1 PREDICTED: uncharacterized protein LOC103485455 [Cucumis melo]3.41e-21167.3Show/hide
Query:  MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM
        MDPP PPSP  S  + KLRLMCSYGG IT RPRTKSL YLGGETRIISVDP  VNTLS+FISHLLTIL I PPF+LKY LP SALDSLISLSSD DL FM
Subjt:  MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM

Query:  LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET
          EHLRLSSS   +S+ SRIRLF+FFPEPEKP    NVIHHPKTEAW  DAL+SAKILQKGRDC VGFDGEGL+GENE KG+ DLG GG SL ESMVLET
Subjt:  LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET

Query:  SSSFGSSSSSASLANV-PLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRP
        SSSFGSSSSSASLANV P IK  +ED+  SS    VA   +A     L S+IA T+SCSS+EN V S+PVISES FH+ AAG+  +N  DFSGY    +P
Subjt:  SSSFGSSSSSASLANV-PLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRP

Query:  NSFQPQALQFVQAGTPVESCLPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLP-----------------MQWGLRDAATGSLSH
        N FQ Q LQFVQ   PVESCLP ++ M SYYP QQPQF+HYQPMPNHMYP+Y+LPVGQTQVS PSNLP                 MQWGL D AT + +H
Subjt:  NSFQPQALQFVQAGTPVESCLPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLP-----------------MQWGLRDAATGSLSH

Query:  ALV-PDASPVA-LSQVAYKEVIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQS
        +LV PDASPV  L QVAYKE++PEPH QN GA P LANP++   ADEV+Q PV I ND A     EV R  +ECND+D  RT IYKSQP PPL     QS
Subjt:  ALV-PDASPVA-LSQVAYKEVIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQS

Query:  KANASTTLLSDAMAQLQMIKIKQ
        K   ST LLSDAMAQLQMIKI Q
Subjt:  KANASTTLLSDAMAQLQMIKIKQ

XP_022133762.1 uncharacterized protein LOC111006259 [Momordica charantia]0.097.61Show/hide
Query:  MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM
        MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM
Subjt:  MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM

Query:  LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET
        LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET
Subjt:  LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET

Query:  SSSFGSSSSSASLANVPLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRPN
        SSSFGSSSSSASLANVPLIKPLNEDFAFSSLDNAVA            SEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRPN
Subjt:  SSSFGSSSSSASLANVPLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRPN

Query:  SFQPQALQFVQAGTPVESCLPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLPMQWGLRDAATGSLSHALVPDASPVALSQVAYKE
        SFQPQALQFVQAGTPVESCLPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLPMQWGLRDAATGSLSHALVPDASPVALSQVAYKE
Subjt:  SFQPQALQFVQAGTPVESCLPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLPMQWGLRDAATGSLSHALVPDASPVALSQVAYKE

Query:  VIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQSKANASTTLLSDAMAQLQMIK
        VIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQSKANASTTLLSDAMAQLQMIK
Subjt:  VIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQSKANASTTLLSDAMAQLQMIK

Query:  IKQ
        IKQ
Subjt:  IKQ

XP_023536931.1 uncharacterized protein LOC111798160 [Cucurbita pepo subsp. pepo]1.91e-19564.91Show/hide
Query:  MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM
        MDP  PP P  +AA  KLRLMCSYGG ITPRPRTKSL YLGGETRIISVDP  VNTLS+FISHLLTIL I PPF+LKYQLP+SALDSLISLSSDDDLQ M
Subjt:  MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM

Query:  LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET
        LC HL LSSS S     SRIRLF+ FPEPEK    +NVIHHPKTEAW VDAL+SAKI QKGRD  VGFDG+ LIGENE K V DLG GGVSLAESM+LET
Subjt:  LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET

Query:  SSSFGSSSSSASLANVPLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRPN
        SSSF SSSSS              DF  SSLDN V     ++    L           S EN V       +S FH   +G++PQN I FSGY LA RPN
Subjt:  SSSFGSSSSSASLANVPLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRPN

Query:  SFQPQALQFVQAGTPVESC-LPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLPMQWGLRDAATGSLSHALVPDASPVALSQVAYK
         FQ QALQFV+A T V+SC LP+++PM SYYP QQPQF+HYQPMP+H+YPLYFLPVGQTQVS PSNLP QW L +AATGSLSH+L          QVAYK
Subjt:  SFQPQALQFVQAGTPVESC-LPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLPMQWGLRDAATGSLSHALVPDASPVALSQVAYK

Query:  EVIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDD--------LARTQIYKSQPPPP--LVPSQLQSKANASTTLL
        EV PEP  Q FGA       +A++ ADEV+QQPV+ISNDAA A S EVA   NECNDDD          RT IYKSQPPPP  LVPSQLQSKA A+T +L
Subjt:  EVIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDD--------LARTQIYKSQPPPP--LVPSQLQSKANASTTLL

Query:  SDAMAQLQMIKIK
        SDAM+QLQMI+ K
Subjt:  SDAMAQLQMIKIK

XP_038884113.1 uncharacterized protein LOC120075037 [Benincasa hispida]6.77e-24574.51Show/hide
Query:  MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM
        MDPP PP    +A + KLRLMCSYGG IT RPRTK+  YLGGETRIISVDP  VNTLSAFISHLLTIL I  PF+LKY LPHSALDSLISLSSDDDL FM
Subjt:  MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM

Query:  LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET
         CEHLRLSSS   +S+ SRIRLF+F PEPEKP    NVIHHPKTEAW VDAL+SAKILQKGRDC VGFDGEGLIGENE KGV DLG GG SL ESMVLET
Subjt:  LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET

Query:  SSSFGSSSSSASLANV-PLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRP
        SSSFGSSSSSASLANV P  KP  EDF  SSLDNA      ++    L SEIA T+SCSS+EN VMSIPVISES FH+ AAG+ PQN  DFSGY LA RP
Subjt:  SSSFGSSSSSASLANV-PLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRP

Query:  NSFQPQALQFVQAGTPVESCLPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLPMQWGLRDAATGSLSHALV-PDASPVA-LSQVA
        N FQ Q LQFVQA   VESCLP+++ M SYYP QQPQF+HYQPMPNH+YP+YFLPVGQTQ+S PSNLP+QWGLRDAAT S +H LV PDASPV  L  VA
Subjt:  NSFQPQALQFVQAGTPVESCLPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLPMQWGLRDAATGSLSHALV-PDASPVA-LSQVA

Query:  YKEVIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQSKANASTTLLSDAMAQLQ
        YKEV+PEPH QN GA P LANP++LE ADEV+QQPV I NDAA   SGEVA  RNECN+DD ART IYKSQP PPLVPS LQSK  AST LLSDAMAQL 
Subjt:  YKEVIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQSKANASTTLLSDAMAQLQ

Query:  MIKIKQ
        MIKI+Q
Subjt:  MIKIKQ

TrEMBL top hitse value%identityAlignment
A0A0A0LMW1 PB1 domain-containing protein2.02e-21366.73Show/hide
Query:  MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM
        MDPP PP    +    KLRLMCSY G IT RPRTKSL YLGGETRIISVDP  VNTLS FISHLLTIL I PPF+LKY LPHSALDSLISLSS DDL FM
Subjt:  MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM

Query:  LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET
          EHLRLSSS   +S+ SRIRLF+FFPEPEKP    NVIHHPKTEAW  DAL+SAKILQKGRDC VGFDGEGLIGENE KG+ DLG GG SL ESMVLET
Subjt:  LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET

Query:  SSSFGSSSSSASLANV-PLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRP
        SSSFGSSSSSASLANV P IKP +EDF  SS+         ++    L S+I  T+SCSS+EN V S+PVI+ES FH+ AAG+  +N  DFSGY    RP
Subjt:  SSSFGSSSSSASLANV-PLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRP

Query:  NSFQPQALQFVQAGTPVESCLPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQ-----------------VSTPSNLPMQWGLRDAATGSLSH
        N FQ Q LQFVQ   PVESCLP ++ M SYYP QQPQF+HYQPMPNHMYP+Y+LPVGQTQ                 VSTPSNLPMQWGL + AT   +H
Subjt:  NSFQPQALQFVQAGTPVESCLPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQ-----------------VSTPSNLPMQWGLRDAATGSLSH

Query:  ALV-PDASPVA-LSQVAYKEVIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQS
        +LV PDASPV  L QVAYKE++PE H QN GA P LANP +LE ADEV+Q PV I ND A   S EV    +E N+DD  RT IYKSQP PPL     QS
Subjt:  ALV-PDASPVA-LSQVAYKEVIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQS

Query:  KANASTTLLSDAMAQLQMIKIKQ
        K  AST LLSDAMAQLQMIKI Q
Subjt:  KANASTTLLSDAMAQLQMIKIKQ

A0A1S3B318 uncharacterized protein LOC1034854551.65e-21167.3Show/hide
Query:  MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM
        MDPP PPSP  S  + KLRLMCSYGG IT RPRTKSL YLGGETRIISVDP  VNTLS+FISHLLTIL I PPF+LKY LP SALDSLISLSSD DL FM
Subjt:  MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM

Query:  LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET
          EHLRLSSS   +S+ SRIRLF+FFPEPEKP    NVIHHPKTEAW  DAL+SAKILQKGRDC VGFDGEGL+GENE KG+ DLG GG SL ESMVLET
Subjt:  LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET

Query:  SSSFGSSSSSASLANV-PLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRP
        SSSFGSSSSSASLANV P IK  +ED+  SS    VA   +A     L S+IA T+SCSS+EN V S+PVISES FH+ AAG+  +N  DFSGY    +P
Subjt:  SSSFGSSSSSASLANV-PLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRP

Query:  NSFQPQALQFVQAGTPVESCLPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLP-----------------MQWGLRDAATGSLSH
        N FQ Q LQFVQ   PVESCLP ++ M SYYP QQPQF+HYQPMPNHMYP+Y+LPVGQTQVS PSNLP                 MQWGL D AT + +H
Subjt:  NSFQPQALQFVQAGTPVESCLPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLP-----------------MQWGLRDAATGSLSH

Query:  ALV-PDASPVA-LSQVAYKEVIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQS
        +LV PDASPV  L QVAYKE++PEPH QN GA P LANP++   ADEV+Q PV I ND A     EV R  +ECND+D  RT IYKSQP PPL     QS
Subjt:  ALV-PDASPVA-LSQVAYKEVIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQS

Query:  KANASTTLLSDAMAQLQMIKIKQ
        K   ST LLSDAMAQLQMIKI Q
Subjt:  KANASTTLLSDAMAQLQMIKIKQ

A0A5D3BU91 Phox/Bem1p1.15e-20467.4Show/hide
Query:  MCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFMLCEHLRLSSSISTASAPSRI
        MCSYGG IT RPRTKSL YLGGETRIISVDP  VNTLS+FISHLLTIL I PPF+LKY LP SALDSLISLSSD DL FM  EHLRLSSS   +S+ SRI
Subjt:  MCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFMLCEHLRLSSSISTASAPSRI

Query:  RLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLETSSSFGSSSSSASLANV-PLI
        RLF+FFPEPEKP    NVIHHPKTEAW  DAL+SAKILQKGRDC VGFDGEGL+GENE KG+ DLG GG SL ESMVLETSSSFGSSSSSASLANV P I
Subjt:  RLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLETSSSFGSSSSSASLANV-PLI

Query:  KPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRPNSFQPQALQFVQAGTPVESC
        K  +ED+  SS    VA   +A     L S+IA T+SCSS+EN V S+PVISES FH+ AAG+  +N  DFSGY    +PN FQ Q LQFVQ   PVESC
Subjt:  KPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRPNSFQPQALQFVQAGTPVESC

Query:  LPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLP-----------------MQWGLRDAATGSLSHALV-PDASPVA-LSQVAYKE
        LP ++ M SYYP QQPQF+HYQPMPNHMYP+Y+LPVGQTQVS PSNLP                 MQWGL D AT + +H+LV PDASPV  L QVAYKE
Subjt:  LPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLP-----------------MQWGLRDAATGSLSHALV-PDASPVA-LSQVAYKE

Query:  VIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQSKANASTTLLSDAMAQLQMIK
        ++PEPH QN GA P LANP++   ADEV+Q PV I ND A     EV R  +ECND+D  RT IYKSQP PPL     QSK   ST LLSDAMAQLQMIK
Subjt:  VIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQSKANASTTLLSDAMAQLQMIK

Query:  IKQ
        I Q
Subjt:  IKQ

A0A6J1BXN6 uncharacterized protein LOC1110062590.097.61Show/hide
Query:  MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM
        MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM
Subjt:  MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM

Query:  LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET
        LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET
Subjt:  LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET

Query:  SSSFGSSSSSASLANVPLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRPN
        SSSFGSSSSSASLANVPLIKPLNEDFAFSSLDNAVA            SEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRPN
Subjt:  SSSFGSSSSSASLANVPLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRPN

Query:  SFQPQALQFVQAGTPVESCLPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLPMQWGLRDAATGSLSHALVPDASPVALSQVAYKE
        SFQPQALQFVQAGTPVESCLPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLPMQWGLRDAATGSLSHALVPDASPVALSQVAYKE
Subjt:  SFQPQALQFVQAGTPVESCLPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLPMQWGLRDAATGSLSHALVPDASPVALSQVAYKE

Query:  VIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQSKANASTTLLSDAMAQLQMIK
        VIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQSKANASTTLLSDAMAQLQMIK
Subjt:  VIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQSKANASTTLLSDAMAQLQMIK

Query:  IKQ
        IKQ
Subjt:  IKQ

A0A6J1E0T6 uncharacterized protein LOC1114297796.66e-18963.33Show/hide
Query:  MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM
        MDP  PP P  +AA  KLRLMCSYGG +TPRPRTKSL YLGGETRIISVDP  VNTLS+FISHLLTIL I PPF+LKYQLPHS LDSLISLSSDDDLQ M
Subjt:  MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFM

Query:  LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET
        L  HL LSSS S     SRIRLF+ FPEPEK    +NVIHHPKTEAW VDAL+SAKI QKGRD  VGFDG+ LIGENE K V DLG GGVSLAESM+LET
Subjt:  LCEHLRLSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLET

Query:  SSSFGSSSSSASLANVPLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRPN
        SSSF SSSSS              DF  SSLDN V     ++    L           S EN V       +S FH   +G++PQN I FSGY LA RPN
Subjt:  SSSFGSSSSSASLANVPLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRPN

Query:  SFQPQALQFVQAGTPVESC-LPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLPMQWGLRDAATGSLSHALVPDASPVALSQVAYK
        SFQ QAL+ V      +SC LP+++PM SYYP QQPQF+HYQPMP+H+YP+Y LPVGQT+VS PSNLP QW L +AATGSLSH L          QVAYK
Subjt:  SFQPQALQFVQAGTPVESC-LPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLPMQWGLRDAATGSLSHALVPDASPVALSQVAYK

Query:  EVIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNEC-----NDDDLARTQIYKSQPPPP--LVPSQLQSKANASTTLLSDA
        EV PEP  Q FGA       +A++ AD V+QQPV+ISNDAA A SGEVA   NEC     N+DD  RT IYKSQPPPP  LVPSQLQSKA A+T +LSDA
Subjt:  EVIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNEC-----NDDDLARTQIYKSQPPPP--LVPSQLQSKANASTTLLSDA

Query:  MAQLQMIKIK
        M+QLQMI+ K
Subjt:  MAQLQMIKIK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01190.1 Octicosapeptide/Phox/Bem1p family protein8.6e-3235.53Show/hide
Query:  SAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFMLCEHLRLSSSI
        SA S KLR MCSYGG I PRP  KSLCY+GG+TRI+ VD    ++L + I+ L   L     FTLKYQLP   LDSLIS+++D+DL  M+ E+ R + S 
Subjt:  SAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFMLCEHLRLSSSI

Query:  STASAPSRIRLFVFFPEPEKPTTAQNVIH-HPKTEAWLVDALESAKILQKGRDCS-------VGFDGEGLIGENEGKG---VGDLGT-------------
        S ++ PSR+RLF+F  +PE   +   ++    K++ W ++AL SA +L +G   S       +G D    +  N G      GD G+             
Subjt:  STASAPSRIRLFVFFPEPEKPTTAQNVIH-HPKTEAWLVDALESAKILQKGRDCS-------VGFDGEGLIGENEGKG---VGDLGT-------------

Query:  ------GGVS---LAESMVLETSSSFGSSSSSASLANVPLIKPLNE---------DFAFSSLDNAVARFLIAERTCNLYSEIAATSS--CSSIENAVMSI
              GG     L +S +L+TSSSFGS+SSS SLAN+P I+   E         D     ++   ARF +  +        AA SS     +  A+ + 
Subjt:  ------GGVS---LAESMVLETSSSFGSSSSSASLANVPLIKPLNE---------DFAFSSLDNAVARFLIAERTCNLYSEIAATSS--CSSIENAVMSI

Query:  PVISESYFHDPAAGIHPQNAID----FSGYTLAPRPNSFQPQALQFVQA
        PV + +  ++  A ++  +        +GY   P P S QPQ L   QA
Subjt:  PVISESYFHDPAAGIHPQNAID----FSGYTLAPRPNSFQPQALQFVQA

AT3G18230.1 Octicosapeptide/Phox/Bem1p family protein4.7e-3036.33Show/hide
Query:  PSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFMLCEHLR
        P P  +    KLRLMCS+GG I PRP  KSL Y GGETRI+ VD  A  +LS+  S L ++L     FTLKYQLP   LDSL+++++D+DL+ M+ E+ R
Subjt:  PSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFMLCEHLR

Query:  LSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDG------EGLIGENEGKGV-----------GDLGTGG
         +SS +TA+A  R+RLF+F  + E   T  +++   K++ W VDAL  + +L +G   S   +       E   GE E + +           GDL   G
Subjt:  LSSSISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDG------EGLIGENEGKGV-----------GDLGTGG

Query:  V---------SLAESMVLETS-SSFGSSSSSASLANVPLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIP
        V         S+ +S ++E + SS GSSSSS S +N+P I+         S D  +   L      N+ ++       S + N  M IP
Subjt:  V---------SLAESMVLETS-SSFGSSSSSASLANVPLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIP

AT5G09620.1 Octicosapeptide/Phox/Bem1p family protein6.0e-1742.31Show/hide
Query:  PSP--PQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVD-----PAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQF
        PSP   Q   + K++LMCSYGG+I PRP    L Y+ G+T+I+SVD     PA V+ LSA  S            + KYQLP   LD+LIS+++D+DL+ 
Subjt:  PSP--PQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVD-----PAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQF

Query:  MLCEHLRLSSSISTASAPSRIRLFVFFPEP
        M+ E+ RL   +  ++ P+R+RLF+F   P
Subjt:  MLCEHLRLSSSISTASAPSRIRLFVFFPEP

AT5G16220.1 Octicosapeptide/Phox/Bem1p family protein1.2e-4633.66Show/hide
Query:  KLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFMLCEHLRLSSSISTASA
        KLR+MC YGG I   P+TKS  Y+GG+TRI+++  +A  + ++ +SHL   L I+ PF +KYQLP   LDSLIS+ +D+D+Q M+ EH  LSS  S    
Subjt:  KLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFMLCEHLRLSSSISTASA

Query:  PSRIRLFVF------------------------------FPEPEKP---TTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEG-KGV
         SRIRLF+F                                E  KP      Q V+ HPKTE W VDAL+S +++Q  R              N G  G 
Subjt:  PSRIRLFVF------------------------------FPEPEKP---TTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEG-KGV

Query:  GDLGTGGVSLAESMVLETSSSFGSSSSSASLANVPLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMS--IPVISESYFHDPAA
        GD G GG+   ESM+LET+SSFGS+SSS S +N+P IK   ED    ++ N+  +F   E         + TS+ +S    + S  +P  S ++ + P++
Subjt:  GDLGTGGVSLAESMVLETSSSFGSSSSSASLANVPLIKPLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMS--IPVISESYFHDPAA

Query:  GIHPQN-----AIDFSGYTLAPRPNSFQPQALQFVQAGTP-VESCLPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLPMQWGLRD
         ++         +  SGY   P  N  Q Q +Q +  G P +    P   P  +Y+       ++YQ  P   YP+Y++PV Q        LP++     
Subjt:  GIHPQN-----AIDFSGYTLAPRPNSFQPQALQFVQAGTP-VESCLPSMFPMASYYPAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLPMQWGLRD

Query:  AATGSLSHALVPDASPVALSQVAYKEVIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDD--DLARTQIYKSQPPPP
          +  L++  V   SPV  +      + PE   Q +    PL+ PV            V  S++A +  +   A I N   DD  D+A  QIYKSQPP P
Subjt:  AATGSLSHALVPDASPVALSQVAYKEVIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAAVALSGEVARIRNECNDD--DLARTQIYKSQPPPP

Query:  LVPSQ
         +PSQ
Subjt:  LVPSQ

AT5G64430.1 Octicosapeptide/Phox/Bem1p family protein3.0e-1641.67Show/hide
Query:  DPPQPP----SPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITP----PFTLKYQLPHSALDSLISLSS
        D P PP    +  Q   S K++ MCSYGG+I PRP    L Y+ GET+I+SVD           S L T+           T KYQLP   LD+LIS+++
Subjt:  DPPQPP----SPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITP----PFTLKYQLPHSALDSLISLSS

Query:  DDDLQFMLCEHLRLSSSISTASAPSRIRLFVF
        DDDL+ M+ E+ RL   +  +S P+R+RLF+F
Subjt:  DDDLQFMLCEHLRLSSSISTASAPSRIRLFVF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCCACCTCAGCCACCGTCACCGCCACAATCTGCCGCCTCGATCAAGCTCCGGTTGATGTGCAGCTACGGCGGCCGGATAACCCCACGGCCGCGCACCAAGTCCCT
CTGCTATTTGGGCGGCGAAACCCGCATAATCTCCGTCGACCCAGCCGCAGTCAACACCCTCTCCGCCTTCATCTCCCATCTCCTCACCATCCTCGCCATTACACCTCCCT
TCACCCTCAAGTATCAGCTCCCTCACTCCGCCCTCGACTCCTTAATCTCCCTCTCCTCCGACGACGACCTCCAATTCATGCTCTGCGAGCACCTCCGCCTCTCCTCCTCC
ATTTCCACCGCTTCTGCTCCCTCTCGCATTCGGCTCTTCGTCTTTTTTCCCGAGCCGGAGAAGCCCACGACGGCCCAAAACGTTATTCATCACCCGAAGACCGAGGCCTG
GTTAGTCGATGCGCTTGAGAGTGCCAAGATTCTGCAGAAGGGCCGCGATTGCTCGGTGGGATTTGATGGGGAGGGTCTGATTGGAGAGAATGAAGGAAAGGGTGTTGGAG
ATTTGGGTACTGGGGGTGTTTCCTTGGCGGAATCAATGGTTCTGGAGACCAGTTCCTCTTTTGGATCATCTTCTTCTTCAGCTTCTTTGGCTAACGTGCCTCTCATTAAA
CCTCTGAATGAAGACTTTGCATTCAGTTCGCTGGATAATGCTGTCGCAAGATTTCTAATAGCTGAAAGAACTTGTAATCTTTACAGCGAGATCGCCGCCACCAGCTCTTG
TTCTTCAATTGAGAATGCGGTTATGTCTATCCCTGTGATATCAGAGAGTTACTTTCATGACCCCGCTGCTGGAATTCATCCCCAGAACGCCATTGATTTTTCAGGCTATA
CACTAGCTCCCCGACCAAACTCTTTTCAGCCGCAGGCATTGCAGTTTGTTCAAGCAGGCACTCCGGTAGAAAGCTGCCTTCCTTCTATGTTTCCAATGGCTTCTTACTAT
CCAGCCCAGCAGCCTCAGTTTCTCCATTACCAGCCGATGCCGAACCATATGTATCCTCTCTACTTTTTGCCTGTTGGACAGACACAGGTTTCAACCCCTTCCAACCTACC
TATGCAGTGGGGCTTGCGTGACGCTGCAACTGGGAGTTTGAGCCACGCTTTGGTTCCCGATGCTTCTCCTGTTGCTCTTTCTCAAGTAGCTTACAAGGAGGTGATTCCTG
AGCCACACCCACAGAATTTTGGAGCGACACCACCTCTTGCAAATCCAGTTGCTTTGGAGCCTGCTGATGAAGTTCGGCAGCAACCTGTGAGCATTTCTAATGATGCTGCA
GTTGCTCTATCTGGTGAAGTTGCTCGTATTCGTAATGAATGCAACGACGATGATCTTGCAAGGACTCAAATATACAAATCTCAGCCTCCACCACCCCTGGTTCCTTCCCA
GTTGCAAAGTAAAGCTAATGCCTCGACGACCCTTCTGTCAGATGCGATGGCTCAGCTACAGATGATCAAAATCAAACAA
mRNA sequenceShow/hide mRNA sequence
GAGAAAACAGAAGGAAGGGAAAAAGAAAAATCAAAGCAAACCGAACGAAAAGAAAAAGAAGAAAGAAATCTAAAATCAGGGTAAAGGCTCCCCTTCCATTCCGGTCCCCT
GTCTTCTGAGTATTTTATTTTGGCGTACACTCCCTCTAGCCAGCGCCGATCTTTCCCTCTCACAAACCCAAATGGACCCACCTCAGCCACCGTCACCGCCACAATCTGCC
GCCTCGATCAAGCTCCGGTTGATGTGCAGCTACGGCGGCCGGATAACCCCACGGCCGCGCACCAAGTCCCTCTGCTATTTGGGCGGCGAAACCCGCATAATCTCCGTCGA
CCCAGCCGCAGTCAACACCCTCTCCGCCTTCATCTCCCATCTCCTCACCATCCTCGCCATTACACCTCCCTTCACCCTCAAGTATCAGCTCCCTCACTCCGCCCTCGACT
CCTTAATCTCCCTCTCCTCCGACGACGACCTCCAATTCATGCTCTGCGAGCACCTCCGCCTCTCCTCCTCCATTTCCACCGCTTCTGCTCCCTCTCGCATTCGGCTCTTC
GTCTTTTTTCCCGAGCCGGAGAAGCCCACGACGGCCCAAAACGTTATTCATCACCCGAAGACCGAGGCCTGGTTAGTCGATGCGCTTGAGAGTGCCAAGATTCTGCAGAA
GGGCCGCGATTGCTCGGTGGGATTTGATGGGGAGGGTCTGATTGGAGAGAATGAAGGAAAGGGTGTTGGAGATTTGGGTACTGGGGGTGTTTCCTTGGCGGAATCAATGG
TTCTGGAGACCAGTTCCTCTTTTGGATCATCTTCTTCTTCAGCTTCTTTGGCTAACGTGCCTCTCATTAAACCTCTGAATGAAGACTTTGCATTCAGTTCGCTGGATAAT
GCTGTCGCAAGATTTCTAATAGCTGAAAGAACTTGTAATCTTTACAGCGAGATCGCCGCCACCAGCTCTTGTTCTTCAATTGAGAATGCGGTTATGTCTATCCCTGTGAT
ATCAGAGAGTTACTTTCATGACCCCGCTGCTGGAATTCATCCCCAGAACGCCATTGATTTTTCAGGCTATACACTAGCTCCCCGACCAAACTCTTTTCAGCCGCAGGCAT
TGCAGTTTGTTCAAGCAGGCACTCCGGTAGAAAGCTGCCTTCCTTCTATGTTTCCAATGGCTTCTTACTATCCAGCCCAGCAGCCTCAGTTTCTCCATTACCAGCCGATG
CCGAACCATATGTATCCTCTCTACTTTTTGCCTGTTGGACAGACACAGGTTTCAACCCCTTCCAACCTACCTATGCAGTGGGGCTTGCGTGACGCTGCAACTGGGAGTTT
GAGCCACGCTTTGGTTCCCGATGCTTCTCCTGTTGCTCTTTCTCAAGTAGCTTACAAGGAGGTGATTCCTGAGCCACACCCACAGAATTTTGGAGCGACACCACCTCTTG
CAAATCCAGTTGCTTTGGAGCCTGCTGATGAAGTTCGGCAGCAACCTGTGAGCATTTCTAATGATGCTGCAGTTGCTCTATCTGGTGAAGTTGCTCGTATTCGTAATGAA
TGCAACGACGATGATCTTGCAAGGACTCAAATATACAAATCTCAGCCTCCACCACCCCTGGTTCCTTCCCAGTTGCAAAGTAAAGCTAATGCCTCGACGACCCTTCTGTC
AGATGCGATGGCTCAGCTACAGATGATCAAAATCAAACAA
Protein sequenceShow/hide protein sequence
MDPPQPPSPPQSAASIKLRLMCSYGGRITPRPRTKSLCYLGGETRIISVDPAAVNTLSAFISHLLTILAITPPFTLKYQLPHSALDSLISLSSDDDLQFMLCEHLRLSSS
ISTASAPSRIRLFVFFPEPEKPTTAQNVIHHPKTEAWLVDALESAKILQKGRDCSVGFDGEGLIGENEGKGVGDLGTGGVSLAESMVLETSSSFGSSSSSASLANVPLIK
PLNEDFAFSSLDNAVARFLIAERTCNLYSEIAATSSCSSIENAVMSIPVISESYFHDPAAGIHPQNAIDFSGYTLAPRPNSFQPQALQFVQAGTPVESCLPSMFPMASYY
PAQQPQFLHYQPMPNHMYPLYFLPVGQTQVSTPSNLPMQWGLRDAATGSLSHALVPDASPVALSQVAYKEVIPEPHPQNFGATPPLANPVALEPADEVRQQPVSISNDAA
VALSGEVARIRNECNDDDLARTQIYKSQPPPPLVPSQLQSKANASTTLLSDAMAQLQMIKIKQ