; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G14133 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G14133
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionFCP1 homology domain-containing protein
Genome locationctg1869:3060376..3064109
RNA-Seq ExpressionCucsat.G14133
SyntenyCucsat.G14133
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004252 - serine-type endopeptidase activity (molecular function)
InterPro domainsIPR004274 - FCP1 homology domain
IPR023214 - HAD superfamily
IPR036412 - HAD-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451628.1 PREDICTED: uncharacterized protein LOC103492827 isoform X1 [Cucumis melo]0.091.22Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDIS CDT EGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASS+DKILFE DPSPDATL+MCSKLESETGK LPEICN +GNVHE EHNDD+KLS+DT
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSG-ISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE
        DTE+ENI+GS NLILNAA VKQNVAG SVEM+EP+SMNAYKEDSG +SEDPGGLR HEVSDQGNID+VAQELSKEMIDV+KDVHSREKLSDP YPLPCNE
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSG-ISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE

Query:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETP
         EY+GDGSLKSLDVEQI+DTFGNNASEKIVEGPVEEASV CS GEHDDEAST KELIMSTPSCVPP LENAETAKEEVVCFT SGETSS VNAMAEE+ P
Subjt:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETP

Query:  PLVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW
         LVLDTSEKGDSIGST KKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLM DYREKLLFCW
Subjt:  PLVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW

Query:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIFGFFWKVYPWQKTFKNTLSRIGLANVPLQKRTRLGSFID
        DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIFGFFWKVY WQK FKN LSRIGLANVPLQKRTRLG+FID
Subjt:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIFGFFWKVYPWQKTFKNTLSRIGLANVPLQKRTRLGSFID

Query:  GSYILLSEKMIRRITILSNGTEKVNLRGIFMHLFGISSCMNGNSVTC
        GSYILLSEK IRRI I+SNGTEKVNLRGIFMHLFGISSCMN  S TC
Subjt:  GSYILLSEKMIRRITILSNGTEKVNLRGIFMHLFGISSCMNGNSVTC

XP_008451760.1 PREDICTED: uncharacterized protein LOC103492827 isoform X2 [Cucumis melo]7.87e-30391.13Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDIS CDT EGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASS+DKILFE DPSPDATL+MCSKLESETGK LPEICN +GNVHE EHNDD+KLS+DT
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSG-ISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE
        DTE+ENI+GS NLILNAA VKQNVAG SVEM+EP+SMNAYKEDSG +SEDPGGLR HEVSDQGNID+VAQELSKEMIDV+KDVHSREKLSDP YPLPCNE
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSG-ISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE

Query:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETP
         EY+GDGSLKSLDVEQI+DTFGNNASEKIVEGPVEEASV CS GEHDDEAST KELIMSTPSCVPP LENAETAKEEVVCFT SGETSS VNAMAEE+ P
Subjt:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETP

Query:  PLVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW
         LVLDTSEKGDSIGST KKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLM DYREKLLFCW
Subjt:  PLVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW

Query:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIF
        DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNP   AIF
Subjt:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIF

XP_011650222.1 uncharacterized protein LOC101203219 isoform X1 [Cucumis sativus]0.0100Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL
        DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL

Query:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETPP
        EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETPP
Subjt:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETPP

Query:  LVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD
        LVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD
Subjt:  LVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD

Query:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIFGFFWKVYPWQKTFKNTLSRIGLANVPLQKRTRLGSFIDG
        QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIFGFFWKVYPWQKTFKNTLSRIGLANVPLQKRTRLGSFIDG
Subjt:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIFGFFWKVYPWQKTFKNTLSRIGLANVPLQKRTRLGSFIDG

Query:  SYILLSEKMIRRITILSNGTEKVNLRGIFMHLFGISSCMNGNSVTC
        SYILLSEKMIRRITILSNGTEKVNLRGIFMHLFGISSCMNGNSVTC
Subjt:  SYILLSEKMIRRITILSNGTEKVNLRGIFMHLFGISSCMNGNSVTC

XP_031738119.1 uncharacterized protein LOC101203219 isoform X2 [Cucumis sativus]0.099.35Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL
        DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL

Query:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETPP
        EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETPP
Subjt:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETPP

Query:  LVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD
        LVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD
Subjt:  LVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD

Query:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIF
        QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNP   AIF
Subjt:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIF

XP_038894829.1 uncharacterized protein LOC120083233 isoform X2 [Benincasa hispida]3.77e-30582.57Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDIS CDT EGLEHKMKKRKQEQFDD SEGNN GSV SG +   SMDKIL E DP+ DA  ++CSKLESETGK LPEICN KGNVH KEHNDD+KLSKD 
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL
        DTE++NING  NLILN  EVK+NVA YSVE+EEPSSMNAYKEDSGISEDPGG+  H+  D GNID+V QELSKEMIDV+KD HSREK SDP Y LPCNE 
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL

Query:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETPP
        E +GDGSLKS +VEQIND FGNNASEKIVEG VEE SVCCS  EHDDE ST KELIMSTP CVPP LENAET KEEV CF+ SGETSS V+A+AEE+TP 
Subjt:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETPP

Query:  LVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD
        LVLDTSEKGDSIG + KKLLVLDVNGLLADFI YVPPGYKPDI+I QKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLM D+REKLLFCWD
Subjt:  LVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD

Query:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIFGFFWKVYPWQKTFKNTLSRIGLANVPLQKRTRLGSFIDG
        QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIFGFFWKVY WQK FK+ LSRI L N PLQKR  LGSFI  
Subjt:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIFGFFWKVYPWQKTFKNTLSRIGLANVPLQKRTRLGSFIDG

Query:  SYILLSEKMIRRITILSNGTEK
        SYILLS K IRRI ILSNGTEK
Subjt:  SYILLSEKMIRRITILSNGTEK

TrEMBL top hitse value%identityAlignment
A0A0A0LSV7 FCP1 homology domain-containing protein0.099.35Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL
        DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL

Query:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETPP
        EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETPP
Subjt:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETPP

Query:  LVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD
        LVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD
Subjt:  LVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD

Query:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIF
        QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNP   AIF
Subjt:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIF

A0A1S3BRN0 uncharacterized protein LOC103492827 isoform X23.81e-30391.13Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDIS CDT EGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASS+DKILFE DPSPDATL+MCSKLESETGK LPEICN +GNVHE EHNDD+KLS+DT
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSG-ISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE
        DTE+ENI+GS NLILNAA VKQNVAG SVEM+EP+SMNAYKEDSG +SEDPGGLR HEVSDQGNID+VAQELSKEMIDV+KDVHSREKLSDP YPLPCNE
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSG-ISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE

Query:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETP
         EY+GDGSLKSLDVEQI+DTFGNNASEKIVEGPVEEASV CS GEHDDEAST KELIMSTPSCVPP LENAETAKEEVVCFT SGETSS VNAMAEE+ P
Subjt:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETP

Query:  PLVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW
         LVLDTSEKGDSIGST KKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLM DYREKLLFCW
Subjt:  PLVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW

Query:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIF
        DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNP   AIF
Subjt:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIF

A0A1S3BSS1 uncharacterized protein LOC103492827 isoform X10.091.22Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDIS CDT EGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASS+DKILFE DPSPDATL+MCSKLESETGK LPEICN +GNVHE EHNDD+KLS+DT
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSG-ISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE
        DTE+ENI+GS NLILNAA VKQNVAG SVEM+EP+SMNAYKEDSG +SEDPGGLR HEVSDQGNID+VAQELSKEMIDV+KDVHSREKLSDP YPLPCNE
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSG-ISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE

Query:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETP
         EY+GDGSLKSLDVEQI+DTFGNNASEKIVEGPVEEASV CS GEHDDEAST KELIMSTPSCVPP LENAETAKEEVVCFT SGETSS VNAMAEE+ P
Subjt:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETP

Query:  PLVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW
         LVLDTSEKGDSIGST KKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLM DYREKLLFCW
Subjt:  PLVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW

Query:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIFGFFWKVYPWQKTFKNTLSRIGLANVPLQKRTRLGSFID
        DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIFGFFWKVY WQK FKN LSRIGLANVPLQKRTRLG+FID
Subjt:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIFGFFWKVYPWQKTFKNTLSRIGLANVPLQKRTRLGSFID

Query:  GSYILLSEKMIRRITILSNGTEKVNLRGIFMHLFGISSCMNGNSVTC
        GSYILLSEK IRRI I+SNGTEKVNLRGIFMHLFGISSCMN  S TC
Subjt:  GSYILLSEKMIRRITILSNGTEKVNLRGIFMHLFGISSCMNGNSVTC

A0A5D3BIK1 Putative C-terminal domain small phosphatase isoform X23.81e-30391.13Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MDIS CDT EGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASS+DKILFE DPSPDATL+MCSKLESETGK LPEICN +GNVHE EHNDD+KLS+DT
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSG-ISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE
        DTE+ENI+GS NLILNAA VKQNVAG SVEM+EP+SMNAYKEDSG +SEDPGGLR HEVSDQGNID+VAQELSKEMIDV+KDVHSREKLSDP YPLPCNE
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSG-ISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNE

Query:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETP
         EY+GDGSLKSLDVEQI+DTFGNNASEKIVEGPVEEASV CS GEHDDEAST KELIMSTPSCVPP LENAETAKEEVVCFT SGETSS VNAMAEE+ P
Subjt:  LEYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETP

Query:  PLVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW
         LVLDTSEKGDSIGST KKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLM DYREKLLFCW
Subjt:  PLVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCW

Query:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIF
        DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNP   AIF
Subjt:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIF

A0A6J1E8F1 uncharacterized protein LOC111431707 isoform X11.01e-24269.75Show/hide
Query:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT
        MD+S CDT EG EHKMKKRKQEQFDDA  GNN  SV SG E ASSM+KIL E D S  A  ++CSKLESET K    +CN    VHEK+ ++D K+S+D 
Subjt:  MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDT

Query:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL
         TE+ NI GS NLI            YSVEMEEPSSM+ YK +SGISED GG+R H+  DQ N+  V +ELSKE ID +KD  SRE+ SD     PCNE 
Subjt:  DTENENINGSYNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNEL

Query:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETPP
        EY+ D SLKS DVEQIN             G VEEASV  S GEHDDE ST KELI+STP C+PP L+NAET KEEVVCF+ SGETSS V+A+ EE+ P 
Subjt:  EYDGDGSLKSLDVEQINDTFGNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETPP

Query:  LVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD
        LVLDTSEKGDSIG + KKLLVLDVNGLLADFICYVP GYKPD++I QKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDM IDFLMRD+R+KLLFCWD
Subjt:  LVLDTSEKGDSIGSTTKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWD

Query:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIFGFFWKVYPWQKTFKNTLSRIGLANVPLQKRTRLGSFIDG
        QSHCTDTTFSTVENKHKPLVLK+IKKLWKYLKPREFNASNTLLLDDSPHKALCNPD  AIFGF WKVY WQK +KNT SRI   N PLQKRTRLGSFI G
Subjt:  QSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIFGFFWKVYPWQKTFKNTLSRIGLANVPLQKRTRLGSFIDG

Query:  SYILLSEKMIRRITILSNGTEKVNLRGIF
        SYI LS + IRR+   S GTEK  L   F
Subjt:  SYILLSEKMIRRITILSNGTEKVNLRGIF

SwissProt top hitse value%identityAlignment
O94336 Uncharacterized FCP1 homology domain-containing protein C1271.03c1.6e-0725.73Show/hide
Query:  PLVLDTSEKGDSIGST-TKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFK-------RPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDY
        P +   S+     G+T  +KL++LD+NG L   +C      +   +  +K+V++       RP   +F+K+ F  F V V+SS    NV  ++  +M + 
Subjt:  PLVLDTSEKGDSIGST-TKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFK-------RPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDY

Query:  REK-LLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYL------KPREFNASNTLLLDDSPHKALCNP
        ++K L+ CW +    D   +  +   K    K +  +W+ +      KP  ++  NT+++DDS  K   +P
Subjt:  REK-LLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYL------KPREFNASNTLLLDDSPHKALCNP

Q9XYL0 Probable C-terminal domain small phosphatase7.7e-0729.25Show/hide
Query:  KLLVLDVNGLLADFICYVPPGYKPDII--------IRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWDQSHCTDTTF
        K LVLD++  L        P + PD I        I Q  V KRPF DDF++   E+FE+ V+++   +  D V+DFL  D    + +   +  C     
Subjt:  KLLVLDVNGLLADFICYVPPGYKPDII--------IRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWDQSHCTDTTF

Query:  STVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPD
            + HK   +K++ +L + LK       +T+++D+SP   L +P+
Subjt:  STVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPD

Arabidopsis top hitse value%identityAlignment
AT2G36540.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein2.5e-2434.18Show/hide
Query:  ETAKEEVVCFTDSGETSSVVNAMAEEETPPLVLDTSEKGDSIGSTTKKLLVLDVNGLLADFI-----CYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFE
        E  K+ ++   DS +  S  + ++++     +LD         +  KKLLVL ++GLL   +        P    PD       V+KRPF ++F+KFC E
Subjt:  ETAKEEVVCFTDSGETSSVVNAMAEEETPPLVLDTSEKGDSIGSTTKKLLVLDVNGLLADFI-----CYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFE

Query:  RFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIF
        RFEVG+WSS        ++  L       +L       CTD+ + T+EN++KPL  K++ K++K  K   F+ASNT+ +DD P+KAL NPD   +F
Subjt:  RFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIF

AT2G36550.1 CONTAINS InterPro DOMAIN/s: NLI interacting factor (InterPro:IPR004274)2.0e-1043.55Show/hide
Query:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIF
        DQ  CTD+ + T+EN  KPL  K++ K+++  K   F+ASNT+ +++ P+KAL NPD   +F
Subjt:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPDREAIF

AT3G29760.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein2.6e-4240.49Show/hide
Query:  GNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKE-EVVCFTDSGETSSVVNAMAEEETPPLVLDTSEKGDSIGSTTKKL
        G    EK  EG V  A         +DE   +K    +  SCV  G E  E  +E  V+    S +  SVV     E     V+     G +     KKL
Subjt:  GNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKE-EVVCFTDSGETSSVVNAMAEEETPPLVLDTSEKGDSIGSTTKKL

Query:  LVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWDQSHCTDTTFSTVENKHKPL
        LVLD+NGLLAD +  +      DI I ++A+FKRPFCD+F++FCF++FEVG+WSSR + NV  + +FL+ D + KLLFCWD S+C  T+  ++EN++K +
Subjt:  LVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWDQSHCTDTTFSTVENKHKPL

Query:  VLKEIKKLWKYLKPR------EFNASNTLLLDDSPHKALCNPDREAI
        V K++ +LW+   PR      ++N +NT+LLDDSP+KAL NP    I
Subjt:  VLKEIKKLWKYLKPR------EFNASNTLLLDDSPHKALCNPDREAI

AT4G26190.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein2.8e-3640.43Show/hide
Query:  ETSSVVNAMAEEETPPLVLDTSEKGDSIGST-----TKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRN
        ETS   + + + +     + +SE GD    T     T+KL++ D+NG+LAD +      + PD  +  ++VF+RPF   F+ FCFERF+V +WSSR R  
Subjt:  ETSSVVNAMAEEETPPLVLDTSEKGDSIGST-----TKKLLVLDVNGLLADFICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRN

Query:  VDMVIDFLMRDYREKLLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYL------KPREFNASNTLLLDDSPHKALCNPDREAIF
        +D +I+ +M+++   LLFC+DQ+ CT T F T E K KPL LK+++++W ++        R+++ +NTLL+DDSP KALCNP    IF
Subjt:  VDMVIDFLMRDYREKLLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYL------KPREFNASNTLLLDDSPHKALCNPDREAIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATTTCAGCATGTGATACAAACGAAGGGTTGGAGCATAAAATGAAAAAGAGAAAGCAAGAACAGTTTGATGATGCTTCTGAAGGAAACAATACGGGTAGTGTTTG
TTCTGGTTTTGAATTTGCATCTTCAATGGATAAAATTCTGTTTGAAAAAGATCCTTCTCCAGATGCCACGTTAGTTATGTGCTCGAAGTTGGAGTCTGAAACAGGAAAAA
ATCTTCCAGAGATATGTAATCTAAAAGGGAATGTTCATGAAAAGGAGCATAATGATGATGAAAAATTGTCAAAGGATACGGATACAGAGAATGAAAATATTAATGGTTCT
TATAATCTTATCCTGAATGCAGCAGAAGTAAAACAAAATGTTGCTGGATATAGTGTTGAAATGGAAGAGCCAAGTTCTATGAATGCTTACAAAGAAGACTCTGGGATCTC
TGAAGATCCAGGTGGTTTGAGAGGTCATGAGGTATCTGATCAAGGAAACATTGACACTGTCGCCCAAGAACTGAGCAAGGAGATGATAGATGTGAAGAAGGATGTTCATT
CTAGAGAAAAACTCTCTGACCCTGATTATCCTTTGCCATGTAATGAGCTGGAATACGATGGGGATGGTTCTTTGAAAAGTTTAGATGTAGAACAAATAAATGACACATTT
GGTAATAATGCTTCAGAAAAGATCGTGGAAGGCCCTGTGGAGGAAGCTTCTGTTTGCTGTTCATTTGGTGAGCATGATGATGAAGCTTCAACGATCAAGGAACTGATTAT
GTCAACTCCTTCCTGTGTGCCTCCAGGACTGGAAAATGCTGAAACTGCCAAGGAAGAAGTTGTATGTTTCACTGATTCTGGTGAGACAAGCAGTGTTGTTAATGCTATGG
CTGAAGAGGAAACTCCACCGCTGGTTTTGGATACTTCAGAGAAAGGAGATTCAATTGGATCTACAACGAAGAAGCTTCTTGTTCTTGATGTAAATGGACTGCTTGCAGAT
TTTATTTGTTACGTTCCACCTGGATATAAGCCAGACATTATAATAAGACAAAAAGCAGTGTTCAAGAGGCCATTTTGTGATGATTTTATTAAGTTTTGTTTTGAAAGATT
CGAGGTGGGTGTTTGGTCGTCAAGAACTCGGAGAAATGTGGACATGGTGATAGATTTTCTAATGAGAGACTATAGAGAAAAATTACTATTTTGCTGGGATCAATCACACT
GTACTGACACCACGTTCTCTACCGTAGAGAACAAGCACAAGCCTTTAGTCTTAAAGGAAATAAAAAAACTGTGGAAATACCTCAAGCCACGAGAGTTCAATGCATCAAAC
ACCCTACTGTTGGATGATTCCCCACACAAGGCATTGTGCAATCCGGACCGGGAGGCGATCTTCGGGTTTTTCTGGAAGGTTTATCCATGGCAGAAAACGTTCAAAAATAC
GTTGAGCAGAATCGGTTTGGCCAACGTCCCATTACAGAAAAGAACGCGTCTTGGAAGTTTTATAGACGGATCATATATTTTGTTGAGCGAAAAAATGATCAGGAGAATAA
CAATTCTTTCCAATGGAACTGAAAAGGTAAACTTGAGAGGCATTTTCATGCATCTTTTTGGTATATCTTCTTGTATGAATGGAAACAGTGTTACATGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATATTTCAGCATGTGATACAAACGAAGGGTTGGAGCATAAAATGAAAAAGAGAAAGCAAGAACAGTTTGATGATGCTTCTGAAGGAAACAATACGGGTAGTGTTTG
TTCTGGTTTTGAATTTGCATCTTCAATGGATAAAATTCTGTTTGAAAAAGATCCTTCTCCAGATGCCACGTTAGTTATGTGCTCGAAGTTGGAGTCTGAAACAGGAAAAA
ATCTTCCAGAGATATGTAATCTAAAAGGGAATGTTCATGAAAAGGAGCATAATGATGATGAAAAATTGTCAAAGGATACGGATACAGAGAATGAAAATATTAATGGTTCT
TATAATCTTATCCTGAATGCAGCAGAAGTAAAACAAAATGTTGCTGGATATAGTGTTGAAATGGAAGAGCCAAGTTCTATGAATGCTTACAAAGAAGACTCTGGGATCTC
TGAAGATCCAGGTGGTTTGAGAGGTCATGAGGTATCTGATCAAGGAAACATTGACACTGTCGCCCAAGAACTGAGCAAGGAGATGATAGATGTGAAGAAGGATGTTCATT
CTAGAGAAAAACTCTCTGACCCTGATTATCCTTTGCCATGTAATGAGCTGGAATACGATGGGGATGGTTCTTTGAAAAGTTTAGATGTAGAACAAATAAATGACACATTT
GGTAATAATGCTTCAGAAAAGATCGTGGAAGGCCCTGTGGAGGAAGCTTCTGTTTGCTGTTCATTTGGTGAGCATGATGATGAAGCTTCAACGATCAAGGAACTGATTAT
GTCAACTCCTTCCTGTGTGCCTCCAGGACTGGAAAATGCTGAAACTGCCAAGGAAGAAGTTGTATGTTTCACTGATTCTGGTGAGACAAGCAGTGTTGTTAATGCTATGG
CTGAAGAGGAAACTCCACCGCTGGTTTTGGATACTTCAGAGAAAGGAGATTCAATTGGATCTACAACGAAGAAGCTTCTTGTTCTTGATGTAAATGGACTGCTTGCAGAT
TTTATTTGTTACGTTCCACCTGGATATAAGCCAGACATTATAATAAGACAAAAAGCAGTGTTCAAGAGGCCATTTTGTGATGATTTTATTAAGTTTTGTTTTGAAAGATT
CGAGGTGGGTGTTTGGTCGTCAAGAACTCGGAGAAATGTGGACATGGTGATAGATTTTCTAATGAGAGACTATAGAGAAAAATTACTATTTTGCTGGGATCAATCACACT
GTACTGACACCACGTTCTCTACCGTAGAGAACAAGCACAAGCCTTTAGTCTTAAAGGAAATAAAAAAACTGTGGAAATACCTCAAGCCACGAGAGTTCAATGCATCAAAC
ACCCTACTGTTGGATGATTCCCCACACAAGGCATTGTGCAATCCGGACCGGGAGGCGATCTTCGGGTTTTTCTGGAAGGTTTATCCATGGCAGAAAACGTTCAAAAATAC
GTTGAGCAGAATCGGTTTGGCCAACGTCCCATTACAGAAAAGAACGCGTCTTGGAAGTTTTATAGACGGATCATATATTTTGTTGAGCGAAAAAATGATCAGGAGAATAA
CAATTCTTTCCAATGGAACTGAAAAGGTAAACTTGAGAGGCATTTTCATGCATCTTTTTGGTATATCTTCTTGTATGAATGGAAACAGTGTTACATGTTAG
Protein sequenceShow/hide protein sequence
MDISACDTNEGLEHKMKKRKQEQFDDASEGNNTGSVCSGFEFASSMDKILFEKDPSPDATLVMCSKLESETGKNLPEICNLKGNVHEKEHNDDEKLSKDTDTENENINGS
YNLILNAAEVKQNVAGYSVEMEEPSSMNAYKEDSGISEDPGGLRGHEVSDQGNIDTVAQELSKEMIDVKKDVHSREKLSDPDYPLPCNELEYDGDGSLKSLDVEQINDTF
GNNASEKIVEGPVEEASVCCSFGEHDDEASTIKELIMSTPSCVPPGLENAETAKEEVVCFTDSGETSSVVNAMAEEETPPLVLDTSEKGDSIGSTTKKLLVLDVNGLLAD
FICYVPPGYKPDIIIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMRDYREKLLFCWDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASN
TLLLDDSPHKALCNPDREAIFGFFWKVYPWQKTFKNTLSRIGLANVPLQKRTRLGSFIDGSYILLSEKMIRRITILSNGTEKVNLRGIFMHLFGISSCMNGNSVTC