; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G06230 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G06230
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionFCP1 homology domain-containing protein
Genome locationChr1:3907012..3911854
RNA-Seq ExpressionCSPI01G06230
SyntenyCSPI01G06230
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451628.1 PREDICTED: uncharacterized protein LOC103492827 isoform X1 [Cucumis melo]2.3e-24291.77Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDIS CDT EGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASS+DKILFE DPSPDATL+MCSKLESETGK LPEICN +GNVHE EHNDD+KLS+DT
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDS-GISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE
        DTE+ENI+GS NLILNAA VKQNVAG SVEM+EP+SMNAYKEDS G+SEDPGGLRDHEVSDQGNID+VAQELSKEMIDV+KDVHSREKLSDP YPLPCNE
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDS-GISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE

Query:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTP
         EY+GDGSLKSLDVEQI+DTFGNNASEKIVEGPVEEASV CS GEHDDEASTSKELIMSTPSCVPP LENAETAKEEVVCFTASGETSS VNAMAEEK P
Subjt:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTP

Query:  PLVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW
         LVLDTSEKGDSIGST KKL+VLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLM DYREKLLFCW
Subjt:  PLVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW

Query:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIF
        DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNP   AIF
Subjt:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIF

XP_008451760.1 PREDICTED: uncharacterized protein LOC103492827 isoform X2 [Cucumis melo]8.3e-29393.16Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDIS CDT EGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASS+DKILFE DPSPDATL+MCSKLESETGK LPEICN +GNVHE EHNDD+KLS+DT
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDS-GISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE
        DTE+ENI+GS NLILNAA VKQNVAG SVEM+EP+SMNAYKEDS G+SEDPGGLRDHEVSDQGNID+VAQELSKEMIDV+KDVHSREKLSDP YPLPCNE
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDS-GISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE

Query:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTP
         EY+GDGSLKSLDVEQI+DTFGNNASEKIVEGPVEEASV CS GEHDDEASTSKELIMSTPSCVPP LENAETAKEEVVCFTASGETSS VNAMAEEK P
Subjt:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTP

Query:  PLVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW
         LVLDTSEKGDSIGST KKL+VLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLM DYREKLLFCW
Subjt:  PLVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW

Query:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVE
        DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVE
Subjt:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVE

Query:  QNRFGQRPITEKNASWKFYRRIIYFVERKNDQENNNSFQWN
        QNRFGQRPITEKNASWKFYRRIIYFVERKNDQEN+N FQWN
Subjt:  QNRFGQRPITEKNASWKFYRRIIYFVERKNDQENNNSFQWN

XP_011650222.1 uncharacterized protein LOC101203219 isoform X1 [Cucumis sativus]7.6e-26298.26Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL
        DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLR HEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL

Query:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTPP
        EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEAST KELIMSTPSCVPPGLENAETAKEEVVCFT SGETSSVVNAMAEE+TPP
Subjt:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTPP

Query:  LVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD
        LVLDTSEKGDSIGSTTKKL+VLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD
Subjt:  LVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD

Query:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIF
        QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNP   AIF
Subjt:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIF

XP_031738119.1 uncharacterized protein LOC101203219 isoform X2 [Cucumis sativus]0.0e+0099.07Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL
        DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLR HEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL

Query:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTPP
        EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEAST KELIMSTPSCVPPGLENAETAKEEVVCFT SGETSSVVNAMAEE+TPP
Subjt:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTPP

Query:  LVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD
        LVLDTSEKGDSIGSTTKKL+VLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD
Subjt:  LVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD

Query:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVEQ
        QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVEQ
Subjt:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVEQ

Query:  NRFGQRPITEKNASWKFYRRIIYFVERKNDQENNNSFQWN
        NRFGQRPITEKNASWKFYRRIIYFVERKNDQENNNSFQWN
Subjt:  NRFGQRPITEKNASWKFYRRIIYFVERKNDQENNNSFQWN

XP_038894827.1 uncharacterized protein LOC120083233 isoform X1 [Benincasa hispida]2.1e-26485.56Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDIS CDT EGLEHKMKKRKQEQFDD SEGNN GSV SG +   SMDKIL E DP+ DA  ++CSKLESETGK LPEICN KGNVH KEHNDD+KLSKD 
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL
        DTE++NING  NLILN  EVK+NVA YSVE+EEPSSMNAYKEDSGISEDPGG+ DH+  D GNID+V QELSKEMIDV+KD HSREK SDP Y LPCNE 
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL

Query:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTPP
        E +GDGSLKS +VEQIND FGNNASEKIVEG VEE SVCCS  EHDDE STSKELIMSTP CVPP LENAET KEE VCF+ASGETSS V+A+AEEKTP 
Subjt:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTPP

Query:  LVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD
        LVLDTSEKGDSIG + KKL+VLDVNGLLADFI YVPPGYKPDI+I QKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLM D+REKLLFCWD
Subjt:  LVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD

Query:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVEQ
        QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVEQ
Subjt:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVEQ

Query:  NRFGQRPITEKNASWKFYRRIIYFVERKNDQENNNSFQWN
        NRFGQRPITEKN SWKFYRRIIYFVERKNDQE+ NSF+WN
Subjt:  NRFGQRPITEKNASWKFYRRIIYFVERKNDQENNNSFQWN

TrEMBL top hitse value%identityAlignment
A0A0A0LSV7 FCP1 homology domain-containing protein0.0e+0099.07Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL
        DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLR HEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL

Query:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTPP
        EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEAST KELIMSTPSCVPPGLENAETAKEEVVCFT SGETSSVVNAMAEE+TPP
Subjt:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTPP

Query:  LVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD
        LVLDTSEKGDSIGSTTKKL+VLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD
Subjt:  LVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD

Query:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVEQ
        QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVEQ
Subjt:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVEQ

Query:  NRFGQRPITEKNASWKFYRRIIYFVERKNDQENNNSFQWN
        NRFGQRPITEKNASWKFYRRIIYFVERKNDQENNNSFQWN
Subjt:  NRFGQRPITEKNASWKFYRRIIYFVERKNDQENNNSFQWN

A0A1S3BRN0 uncharacterized protein LOC103492827 isoform X24.0e-29393.16Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDIS CDT EGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASS+DKILFE DPSPDATL+MCSKLESETGK LPEICN +GNVHE EHNDD+KLS+DT
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDS-GISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE
        DTE+ENI+GS NLILNAA VKQNVAG SVEM+EP+SMNAYKEDS G+SEDPGGLRDHEVSDQGNID+VAQELSKEMIDV+KDVHSREKLSDP YPLPCNE
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDS-GISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE

Query:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTP
         EY+GDGSLKSLDVEQI+DTFGNNASEKIVEGPVEEASV CS GEHDDEASTSKELIMSTPSCVPP LENAETAKEEVVCFTASGETSS VNAMAEEK P
Subjt:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTP

Query:  PLVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW
         LVLDTSEKGDSIGST KKL+VLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLM DYREKLLFCW
Subjt:  PLVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW

Query:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVE
        DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVE
Subjt:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVE

Query:  QNRFGQRPITEKNASWKFYRRIIYFVERKNDQENNNSFQWN
        QNRFGQRPITEKNASWKFYRRIIYFVERKNDQEN+N FQWN
Subjt:  QNRFGQRPITEKNASWKFYRRIIYFVERKNDQENNNSFQWN

A0A1S3BSS1 uncharacterized protein LOC103492827 isoform X11.1e-24291.77Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDIS CDT EGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASS+DKILFE DPSPDATL+MCSKLESETGK LPEICN +GNVHE EHNDD+KLS+DT
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDS-GISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE
        DTE+ENI+GS NLILNAA VKQNVAG SVEM+EP+SMNAYKEDS G+SEDPGGLRDHEVSDQGNID+VAQELSKEMIDV+KDVHSREKLSDP YPLPCNE
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDS-GISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE

Query:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTP
         EY+GDGSLKSLDVEQI+DTFGNNASEKIVEGPVEEASV CS GEHDDEASTSKELIMSTPSCVPP LENAETAKEEVVCFTASGETSS VNAMAEEK P
Subjt:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTP

Query:  PLVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW
         LVLDTSEKGDSIGST KKL+VLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLM DYREKLLFCW
Subjt:  PLVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW

Query:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIF
        DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNP   AIF
Subjt:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIF

A0A5D3BIK1 Putative C-terminal domain small phosphatase isoform X24.0e-29393.16Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDIS CDT EGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASS+DKILFE DPSPDATL+MCSKLESETGK LPEICN +GNVHE EHNDD+KLS+DT
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDS-GISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE
        DTE+ENI+GS NLILNAA VKQNVAG SVEM+EP+SMNAYKEDS G+SEDPGGLRDHEVSDQGNID+VAQELSKEMIDV+KDVHSREKLSDP YPLPCNE
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDS-GISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE

Query:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTP
         EY+GDGSLKSLDVEQI+DTFGNNASEKIVEGPVEEASV CS GEHDDEASTSKELIMSTPSCVPP LENAETAKEEVVCFTASGETSS VNAMAEEK P
Subjt:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTP

Query:  PLVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW
         LVLDTSEKGDSIGST KKL+VLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLM DYREKLLFCW
Subjt:  PLVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW

Query:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVE
        DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVE
Subjt:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVE

Query:  QNRFGQRPITEKNASWKFYRRIIYFVERKNDQENNNSFQWN
        QNRFGQRPITEKNASWKFYRRIIYFVERKNDQEN+N FQWN
Subjt:  QNRFGQRPITEKNASWKFYRRIIYFVERKNDQENNNSFQWN

A0A6J1C778 uncharacterized protein LOC1110088761.8e-23776.97Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDIS  DT+  LEHK KKRKQEQFD A E NN GSVCSG E ASSMD IL E DP+PDAT+++CSKLESETG++ P ICN +GNVH KEHNDD + SKD 
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDV----HSREKLSDPDYPLP
        DTE+ENINGS +LILN  +VKQNVA YS+EMEEPSSMNAYKEDS +SEDP   RDH+  D GN+  V QEL+KEMID  KD     HS EK SD DY  P
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDV----HSREKLSDPDYPLP

Query:  CNELEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFG-EHDDEASTSKELIMSTPSCVPPGLENAETAK--EEVVCFTASGETSSVVNAM
        CNE EY+ D SLK+ D+EQIN   GNN SEK V+  +EEASVCC+ G E+DDE STSKE I+STPSC+PP  ENA+T K  EEVVCF+ASGETS  ++A+
Subjt:  CNELEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFG-EHDDEASTSKELIMSTPSCVPPGLENAETAK--EEVVCFTASGETSSVVNAM

Query:  AEEKTPPLVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYRE
         EE  P LVLDTSEKGDSIGS+ KKL+VLDVNGLLADFICYVP GYKPDIII QKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLM D ++
Subjt:  AEEKTPPLVLDTSEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYRE

Query:  KLLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAEN
        KLLFCWDQSHCTDTTFSTVEN HKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRD  DTSLGPGGDLRV+LEGLS+AEN
Subjt:  KLLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAEN

Query:  VQKYVEQNRFGQRPITEKNASWKFYRRIIYFVERKNDQENNNSFQWN
        VQKYVEQN FGQRPITEKN SWKFYRRIIYFVER+ND+++ NSF+WN
Subjt:  VQKYVEQNRFGQRPITEKNASWKFYRRIIYFVERKNDQENNNSFQWN

SwissProt top hitse value%identityAlignment
O14595 Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 24.7e-0428.22Show/hide
Query:  QKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVID-------FLMRDYREKLLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNAS
        Q  V KRP+ D+F++   E FE  ++++   +  D V D       F  R +RE  +F                  H+   +K++ +L + L+       
Subjt:  QKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVID-------FLMRDYREKLLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNAS

Query:  NTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVEQNR
         TL+LD+SP   + +P N    PV   F D  DT L    +L    E LS AE+V   + Q R
Subjt:  NTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVEQNR

O94336 Uncharacterized FCP1 homology domain-containing protein C1271.03c6.9e-0826.01Show/hide
Query:  PLVLDTSEKGDSIGST-TKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFK-------RPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDY
        P +   S+     G+T  +KL++LD+NG L   +C      +   +  +K+V++       RP   +F+K+ F  F V V+SS    NV  ++  +M + 
Subjt:  PLVLDTSEKGDSIGST-TKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFK-------RPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDY

Query:  REK-LLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYL------KPREFNASNTLLLDDSPHKALCNPAN
        ++K L+ CW +    D   +  +   K    K +  +W+ +      KP  ++  NT+++DDS  K   +P N
Subjt:  REK-LLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYL------KPREFNASNTLLLDDSPHKALCNPAN

Q9XYL0 Probable C-terminal domain small phosphatase2.5e-1029.02Show/hide
Query:  KLIVLDVNGLLADFICYVPPGYKPDII--------IRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWDQSHCTDTTF
        K +VLD++  L        P + PD I        I Q  V KRPF DDF++   E+FE+ V+++   +  D V+DFL  D    + +   +  C     
Subjt:  KLIVLDVNGLLADFICYVPPGYKPDII--------IRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWDQSHCTDTTF

Query:  STVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVEQNR
            + HK   +K++ +L + LK       +T+++D+SP   L +P N    P+   F D DD  L    DL   L+ L   E+V+  ++++R
Subjt:  STVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVEQNR

Arabidopsis top hitse value%identityAlignment
AT2G36540.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein1.9e-3735.02Show/hide
Query:  MAEEKTPPLVL------DTSEKGDSIGSTT-----------------KKLIVLDVNGLLADFI-----CYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCF
        MAEEK    +L      D   +GD++   T                 KKL+VL ++GLL   +        P    PD       V+KRPF ++F+KFC 
Subjt:  MAEEKTPPLVL------DTSEKGDSIGSTT-----------------KKLIVLDVNGLLADFI-----CYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCF

Query:  ERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVT
        ERFEVG+WSS        ++  L       +L       CTD+ + T+EN++KPL  K++ K++K  K   F+ASNT+ +DD P+KAL NP NT +FP++
Subjt:  ERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVT

Query:  YRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVEQNRFGQRPITEKNASWKFYRRI
        Y   +  D  L P G+L  +LEGL+ + +VQ Y++ + FG+  I   +  W FY  +
Subjt:  YRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVEQNRFGQRPITEKNASWKFYRRI

AT2G36550.1 CONTAINS InterPro DOMAIN/s: NLI interacting factor (InterPro:IPR004274)3.8e-2540.98Show/hide
Query:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVE
        DQ  CTD+ + T+EN  KPL  K++ K+++  K   F+ASNT+ +++ P+KAL NP NT +FP++Y   DT D  L P G+   +L+GL+ + +VQ Y++
Subjt:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVE

Query:  QNRFGQRPITEKNASWKFYRRI
        ++ FGQ  I   +  W +YRR+
Subjt:  QNRFGQRPITEKNASWKFYRRI

AT3G29760.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein2.2e-4137.5Show/hide
Query:  SLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKE-EVVCFTASGETSSVVNAMAEEKTPPLVLDT
        S   + V + ND+   +  E I +   +E SV  +   +D+       ++ +  SCV  G E  E  +E  V+    S +  SVV     E     V+  
Subjt:  SLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKE-EVVCFTASGETSSVVNAMAEEKTPPLVLDT

Query:  SEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWDQSHCT
           G +     KKL+VLD+NGLLAD +  +      DI I ++A+FKRPFCD+F++FCF++FEVG+WSSR + NV  + +FL+ D + KLLFCWD S+C 
Subjt:  SEKGDSIGSTTKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWDQSHCT

Query:  DTTFSTVENKHKPLVLKEIKKLWKYLKPR------EFNASNTLLLDDSPHKALCNPANTAIFPV
         T+  ++EN++K +V K++ +LW+   PR      ++N +NT+LLDDSP+KAL NP  + I  +
Subjt:  DTTFSTVENKHKPLVLKEIKKLWKYLKPR------EFNASNTLLLDDSPHKALCNPANTAIFPV

AT4G26190.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein4.1e-5642.97Show/hide
Query:  ETSSVVNAMAEEKTPPLVLDTSEKGDSIGST-----TKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRN
        ETS   + + +       + +SE GD    T     T+KL++ D+NG+LAD +      + PD  +  ++VF+RPF   F+ FCFERF+V +WSSR R  
Subjt:  ETSSVVNAMAEEKTPPLVLDTSEKGDSIGST-----TKKLIVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRN

Query:  VDMVIDFLMRDYREKLLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYL------KPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDT
        +D +I+ +M+++   LLFC+DQ+ CT T F T E K KPL LK+++++W ++        R+++ +NTLL+DDSP KALCNP +T IFP  Y++ +  D+
Subjt:  VDMVIDFLMRDYREKLLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYL------KPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDT

Query:  SLGPGGDLRVFLEGLSMAENVQKYVEQNRFGQRPITEKNASWKFYRRII
        +LGP G+LR +LE L+ AENVQK+V +N FGQ  ITE + SW+FY + +
Subjt:  SLGPGGDLRVFLEGLSMAENVQKYVEQNRFGQRPITEKNASWKFYRRII


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATTTCAGCATGTGATACAAACGAAGGGTTGGAGCATAAAATGAAAAAGAGAAAGCAAGAACAGTTTGATGATGCTTCTGAAGGAAACAATACGGGTAGTGTTTG
TTCTGGTTTTGAATTTGCATCTTCAATGGATAAAATTCTGTTTGAAAAAGATCCTTCTCCAGATGCCACGTTAGTTATGTGCTCGAAGTTGGAGTCTGAAACAGGAAAAA
ATCTTCCAGAGATATGTAATCTAAAAGGGAATGTTCATGAAAAGGAGCATAATGATGATGAAAAATTGTCAAAGGATACGGATACAGAGAATGAAAATATTAATGGTTCT
TATAATCTTATCCTGAATGCAGCAGAAGTAAAACAAAATGTTGCTGGATATAGTGTTGAAATGGAAGAGCCAAGTTCTATGAATGCTTACAAAGAAGACTCTGGGATCTC
TGAAGATCCAGGTGGTTTGAGAGATCATGAGGTATCTGATCAAGGAAACATTGACACTGTCGCCCAAGAACTGAGCAAGGAGATGATAGATGTGAAGAAGGATGTTCATT
CTAGAGAAAAACTCTCTGACCCTGATTATCCTTTGCCATGTAATGAGCTGGAATACGATGGGGATGGTTCTTTGAAAAGTTTAGATGTAGAACAAATAAATGACACATTT
GGTAATAATGCTTCAGAAAAGATCGTGGAAGGCCCTGTGGAGGAAGCTTCTGTTTGCTGTTCATTTGGTGAGCATGATGATGAAGCTTCAACGAGCAAGGAACTGATTAT
GTCAACTCCTTCCTGTGTGCCTCCAGGACTGGAAAATGCTGAAACTGCCAAGGAAGAAGTTGTATGTTTCACTGCTTCTGGTGAGACAAGCAGTGTTGTTAATGCTATGG
CTGAAGAGAAAACTCCCCCGCTGGTTTTGGATACTTCAGAGAAAGGAGATTCAATTGGATCTACAACGAAGAAGCTTATTGTTCTTGATGTAAATGGACTGCTTGCAGAT
TTTATTTGTTACGTTCCACCTGGATATAAGCCAGACATTATAATAAGACAAAAAGCAGTGTTCAAGAGGCCATTTTGTGATGATTTTATTAAGTTTTGTTTTGAAAGATT
CGAGGTGGGTGTTTGGTCGTCAAGAACTCGGAGAAATGTGGACATGGTGATAGATTTTCTAATGAGAGACTATAGAGAAAAATTACTATTTTGCTGGGATCAATCACACT
GTACTGACACCACGTTCTCTACCGTAGAGAACAAGCACAAGCCTTTAGTCTTAAAGGAAATAAAAAAACTGTGGAAATACCTCAAGCCACGAGAGTTCAATGCATCAAAC
ACCCTACTGTTGGATGATTCCCCACACAAGGCATTGTGCAATCCGGCAAACACTGCAATATTTCCTGTAACATATCGGTTTAGGGATACTGACGATACGTCTTTAGGACC
GGGAGGCGATCTTCGGGTTTTTCTGGAAGGTTTATCGATGGCAGAAAACGTTCAAAAATACGTTGAGCAGAATCGGTTTGGCCAACGTCCCATTACAGAAAAGAACGCGT
CTTGGAAGTTTTATAGACGGATCATATATTTTGTTGAGCGAAAAAATGATCAGGAGAATAACAATTCTTTCCAATGGAACTGA
mRNA sequenceShow/hide mRNA sequence
GAAAAGCATATTATATATTTGATAATGAAAAGAAAAAGAAAAGAAAAAGAAAAAAGGAAAAAAAATCAGAGGAGAAGATTGATGGTCAGTGAGCCGCCGATGCGCAAATG
GGCTGCCCGCGAGAAGAACGACGGCGCTACATCCAACAAAAGGAGAGAAATTTTGAGCTGCCATCCAACGACATTCAATGCCACACTCTTCTTTTTTGTCCTGCCAATAG
TTACAGTTCATTGCTTGTTCATCTGATCCCATGTTCGATCTGTAGGAGAAAATTCCTTCAAGGAAACACTTGAAGAGTTCTCAAAGAACTTGACCTTTTGAATTTATGAG
GATATAGAGTGGGATTTCTGCAAAATATCTGGAAATATTGGTCATGGATATTTCAGCATGTGATACAAACGAAGGGTTGGAGCATAAAATGAAAAAGAGAAAGCAAGAAC
AGTTTGATGATGCTTCTGAAGGAAACAATACGGGTAGTGTTTGTTCTGGTTTTGAATTTGCATCTTCAATGGATAAAATTCTGTTTGAAAAAGATCCTTCTCCAGATGCC
ACGTTAGTTATGTGCTCGAAGTTGGAGTCTGAAACAGGAAAAAATCTTCCAGAGATATGTAATCTAAAAGGGAATGTTCATGAAAAGGAGCATAATGATGATGAAAAATT
GTCAAAGGATACGGATACAGAGAATGAAAATATTAATGGTTCTTATAATCTTATCCTGAATGCAGCAGAAGTAAAACAAAATGTTGCTGGATATAGTGTTGAAATGGAAG
AGCCAAGTTCTATGAATGCTTACAAAGAAGACTCTGGGATCTCTGAAGATCCAGGTGGTTTGAGAGATCATGAGGTATCTGATCAAGGAAACATTGACACTGTCGCCCAA
GAACTGAGCAAGGAGATGATAGATGTGAAGAAGGATGTTCATTCTAGAGAAAAACTCTCTGACCCTGATTATCCTTTGCCATGTAATGAGCTGGAATACGATGGGGATGG
TTCTTTGAAAAGTTTAGATGTAGAACAAATAAATGACACATTTGGTAATAATGCTTCAGAAAAGATCGTGGAAGGCCCTGTGGAGGAAGCTTCTGTTTGCTGTTCATTTG
GTGAGCATGATGATGAAGCTTCAACGAGCAAGGAACTGATTATGTCAACTCCTTCCTGTGTGCCTCCAGGACTGGAAAATGCTGAAACTGCCAAGGAAGAAGTTGTATGT
TTCACTGCTTCTGGTGAGACAAGCAGTGTTGTTAATGCTATGGCTGAAGAGAAAACTCCCCCGCTGGTTTTGGATACTTCAGAGAAAGGAGATTCAATTGGATCTACAAC
GAAGAAGCTTATTGTTCTTGATGTAAATGGACTGCTTGCAGATTTTATTTGTTACGTTCCACCTGGATATAAGCCAGACATTATAATAAGACAAAAAGCAGTGTTCAAGA
GGCCATTTTGTGATGATTTTATTAAGTTTTGTTTTGAAAGATTCGAGGTGGGTGTTTGGTCGTCAAGAACTCGGAGAAATGTGGACATGGTGATAGATTTTCTAATGAGA
GACTATAGAGAAAAATTACTATTTTGCTGGGATCAATCACACTGTACTGACACCACGTTCTCTACCGTAGAGAACAAGCACAAGCCTTTAGTCTTAAAGGAAATAAAAAA
ACTGTGGAAATACCTCAAGCCACGAGAGTTCAATGCATCAAACACCCTACTGTTGGATGATTCCCCACACAAGGCATTGTGCAATCCGGCAAACACTGCAATATTTCCTG
TAACATATCGGTTTAGGGATACTGACGATACGTCTTTAGGACCGGGAGGCGATCTTCGGGTTTTTCTGGAAGGTTTATCGATGGCAGAAAACGTTCAAAAATACGTTGAG
CAGAATCGGTTTGGCCAACGTCCCATTACAGAAAAGAACGCGTCTTGGAAGTTTTATAGACGGATCATATATTTTGTTGAGCGAAAAAATGATCAGGAGAATAACAATTC
TTTCCAATGGAACTGAAAAGGTAAACTTGAGAGGCATTTTCATGCATCTTTTTGGTATATCTTCTTGTA
Protein sequenceShow/hide protein sequence
MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDTDTENENINGS
YNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRDHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNELEYDGDGSLKSLDVEQINDTF
GNNASEKIVEGPVEEASVCCSFGEHDDEASTSKELIMSTPSCVPPGLENAETAKEEVVCFTASGETSSVVNAMAEEKTPPLVLDTSEKGDSIGSTTKKLIVLDVNGLLAD
FICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASN
TLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQKYVEQNRFGQRPITEKNASWKFYRRIIYFVERKNDQENNNSFQWN