; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020621 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020621
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionFCP1 homology domain-containing protein
Genome locationChr05:1003089..1006701
RNA-Seq ExpressionHG10020621
SyntenyHG10020621
Gene Ontology termsNA
InterPro domainsIPR004274 - FCP1 homology domain
IPR023214 - HAD superfamily
IPR036412 - HAD-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451760.1 PREDICTED: uncharacterized protein LOC103492827 isoform X2 [Cucumis melo]3.1e-23079.66Show/hide
Query:  MKKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIP
        MKKRKQEQFD+A EGNN GSV S  E ASS+DKIL ENDP+P AT I+C KLESETGKTLP+ICNS+GNVHE EHNDDQKLS+D DTEHENI+GSDNLI 
Subjt:  MKKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIP

Query:  DAADVKQNVARYSVEMEEPSSTNAYKEDS-GISEDPGGMRDHE--RHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVG
        +AA VKQNVA  SVEM+EP+S NAYKEDS G+SEDPGG+RDHE    GNID+VAQELSKEMIDVRKD HS E  SDP Y LPCNEQEYEGDGSLKS DV 
Subjt:  DAADVKQNVARYSVEMEEPSSTNAYKEDS-GISEDPGGMRDHE--RHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVG

Query:  QINDAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGF
        QI+D FGNNAS+KIVEG VEE SV CS+ EHD+E STSKELIMSTPSC+PP LENAETAKEEVVCF+ASGETSSG++A+A+EK PSLVL+TSEKGDSIG 
Subjt:  QINDAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGF

Query:  STKKLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVEN
        + KKLLVLDVNGLLADFI YVPPGYKPDI+I QKAVFKRPFCDDFIKFCFERFE                               DQSHCTDTTFSTVEN
Subjt:  STKKLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVEN

Query:  KHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEKNPS
        KHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQ YVEQNRFGQRPITEKN S
Subjt:  KHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEKNPS

Query:  WKFYRRVIYFVERKNDQEDTNSFRWN
        WKFYRR+IYFVERKNDQE++N F+WN
Subjt:  WKFYRRVIYFVERKNDQEDTNSFRWN

XP_022137426.1 uncharacterized protein LOC111008876 [Momordica charantia]1.2e-21073.53Show/hide
Query:  KKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIPD
        KKRKQEQFD A E NN+GSV S SE+ASSMD ILSENDPAP AT IIC KLESETG++ P ICN +GNVH KEHNDD + SKDMDTEHENINGS +LI +
Subjt:  KKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIPD

Query:  AADVKQNVARYSVEMEEPSSTNAYKEDSGISEDPGGMRDHERHGNIDNVAQELSKEMIDVRKD----DHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVG
          DVKQNVARYS+EMEEPSS NAYKEDS +SEDP   RDH+ HGN+  V QEL+KEMID  KD     HS E  SD +YH PCNEQEYE D SLK+SD+ 
Subjt:  AADVKQNVARYSVEMEEPSSTNAYKEDSGISEDPGGMRDHERHGNIDNVAQELSKEMIDVRKD----DHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVG

Query:  QINDAFGNNASKKIVEGAVEEISVCCSI-SEHDNETSTSKELIMSTPSCMPPELENAETAK--EEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDS
        QIN A GNN S+K V+ A+EE SVCC++  E+D+ETSTSKE I+STPSCMPPE ENA+T K  EEVVCFSASGETS  +DA+ +E  P LVL+TSEKGDS
Subjt:  QINDAFGNNASKKIVEGAVEEISVCCSI-SEHDNETSTSKELIMSTPSCMPPELENAETAK--EEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDS

Query:  IGFSTKKLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFST
        IG S KKLLVLDVNGLLADFI YVP GYKPDI+IGQKAVFKRPFCDDFIKFCFERFE                               DQSHCTDTTFST
Subjt:  IGFSTKKLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFST

Query:  VENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEK
        VEN HKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRD  DTSLGPGGDLRV+LEGLS+AENVQ YVEQN FGQRPITEK
Subjt:  VENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEK

Query:  NPSWKFYRRVIYFVERKNDQEDTNSFRWN
        N SWKFYRR+IYFVER+ND++DTNSF+WN
Subjt:  NPSWKFYRRVIYFVERKNDQEDTNSFRWN

XP_031738119.1 uncharacterized protein LOC101203219 isoform X2 [Cucumis sativus]5.3e-23079.43Show/hide
Query:  MKKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIP
        MKKRKQEQFD+A EGNN GSV S  E ASSMDKIL E DP+P AT ++C KLESETGK LP+ICN KGNVHEKEHNDD+KLSKD DTE+ENINGS NLI 
Subjt:  MKKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIP

Query:  DAADVKQNVARYSVEMEEPSSTNAYKEDSGISEDPGGMRDHE--RHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVGQ
        +AA+VKQNVA YSVEMEEPSS NAYKEDSGISEDPGG+R HE    GNID VAQELSKEMIDV+KD HS E  SDP+Y LPCNE EY+GDGSLKS DV Q
Subjt:  DAADVKQNVARYSVEMEEPSSTNAYKEDSGISEDPGGMRDHE--RHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVGQ

Query:  INDAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGFS
        IND FGNNAS+KIVEG VEE SVCCS  EHD+E ST KELIMSTPSC+PP LENAETAKEEVVCF+ SGETSS ++A+A+E+TP LVL+TSEKGDSIG +
Subjt:  INDAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGFS

Query:  TKKLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVENK
        TKKLLVLDVNGLLADFI YVPPGYKPDI+I QKAVFKRPFCDDFIKFCFERFE                               DQSHCTDTTFSTVENK
Subjt:  TKKLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVENK

Query:  HKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEKNPSW
        HKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQ YVEQNRFGQRPITEKN SW
Subjt:  HKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEKNPSW

Query:  KFYRRVIYFVERKNDQEDTNSFRWN
        KFYRR+IYFVERKNDQE+ NSF+WN
Subjt:  KFYRRVIYFVERKNDQEDTNSFRWN

XP_038894827.1 uncharacterized protein LOC120083233 isoform X1 [Benincasa hispida]2.4e-24683.75Show/hide
Query:  MKKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIP
        MKKRKQEQFD+  EGNNMGSV S S+D  SMDKILSENDPA  A FIIC KLESETGKTLP+ICNSKGNVH KEHNDDQKLSKDMDTEH+NING DNLI 
Subjt:  MKKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIP

Query:  DAADVKQNVARYSVEMEEPSSTNAYKEDSGISEDPGGMRDHERHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVGQIN
        +  +VK+NVARYSVE+EEPSS NAYKEDSGISEDPGGM DH+ HGNID+V QELSKEMIDVRKDDHS E FSDP YHLPCNE+E EGDGSLKSS+V QIN
Subjt:  DAADVKQNVARYSVEMEEPSSTNAYKEDSGISEDPGGMRDHERHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVGQIN

Query:  DAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGFSTK
        D FGNNAS+KIVEGAVEE SVCCS++EHD+ETSTSKELIMSTP C+PPELENAET KEE VCFSASGETSSG+DA+A+EKTPSLVL+TSEKGDSIGFS K
Subjt:  DAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGFSTK

Query:  KLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVENKHK
        KLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE                               DQSHCTDTTFSTVENKHK
Subjt:  KLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVENKHK

Query:  PLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEKNPSWKF
        PLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQ YVEQNRFGQRPITEKNPSWKF
Subjt:  PLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEKNPSWKF

Query:  YRRVIYFVERKNDQEDTNSFRWN
        YRR+IYFVERKNDQEDTNSF+WN
Subjt:  YRRVIYFVERKNDQEDTNSFRWN

XP_038894829.1 uncharacterized protein LOC120083233 isoform X2 [Benincasa hispida]2.5e-19580.86Show/hide
Query:  MKKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIP
        MKKRKQEQFD+  EGNNMGSV S S+D  SMDKILSENDPA  A FIIC KLESETGKTLP+ICNSKGNVH KEHNDDQKLSKDMDTEH+NING DNLI 
Subjt:  MKKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIP

Query:  DAADVKQNVARYSVEMEEPSSTNAYKEDSGISEDPGGMRDHERHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVGQIN
        +  +VK+NVARYSVE+EEPSS NAYKEDSGISEDPGGM DH+ HGNID+V QELSKEMIDVRKDDHS E FSDP YHLPCNE+E EGDGSLKSS+V QIN
Subjt:  DAADVKQNVARYSVEMEEPSSTNAYKEDSGISEDPGGMRDHERHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVGQIN

Query:  DAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGFSTK
        D FGNNAS+KIVEGAVEE SVCCS++EHD+ETSTSKELIMSTP C+PPELENAET KEE VCFSASGETSSG+DA+A+EKTPSLVL+TSEKGDSIGFS K
Subjt:  DAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGFSTK

Query:  KLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVENKHK
        KLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE                               DQSHCTDTTFSTVENKHK
Subjt:  KLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVENKHK

Query:  PLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIF
        PLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNP   AIF
Subjt:  PLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIF

TrEMBL top hitse value%identityAlignment
A0A0A0LSV7 FCP1 homology domain-containing protein2.6e-23079.43Show/hide
Query:  MKKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIP
        MKKRKQEQFD+A EGNN GSV S  E ASSMDKIL E DP+P AT ++C KLESETGK LP+ICN KGNVHEKEHNDD+KLSKD DTE+ENINGS NLI 
Subjt:  MKKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIP

Query:  DAADVKQNVARYSVEMEEPSSTNAYKEDSGISEDPGGMRDHE--RHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVGQ
        +AA+VKQNVA YSVEMEEPSS NAYKEDSGISEDPGG+R HE    GNID VAQELSKEMIDV+KD HS E  SDP+Y LPCNE EY+GDGSLKS DV Q
Subjt:  DAADVKQNVARYSVEMEEPSSTNAYKEDSGISEDPGGMRDHE--RHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVGQ

Query:  INDAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGFS
        IND FGNNAS+KIVEG VEE SVCCS  EHD+E ST KELIMSTPSC+PP LENAETAKEEVVCF+ SGETSS ++A+A+E+TP LVL+TSEKGDSIG +
Subjt:  INDAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGFS

Query:  TKKLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVENK
        TKKLLVLDVNGLLADFI YVPPGYKPDI+I QKAVFKRPFCDDFIKFCFERFE                               DQSHCTDTTFSTVENK
Subjt:  TKKLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVENK

Query:  HKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEKNPSW
        HKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQ YVEQNRFGQRPITEKN SW
Subjt:  HKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEKNPSW

Query:  KFYRRVIYFVERKNDQEDTNSFRWN
        KFYRR+IYFVERKNDQE+ NSF+WN
Subjt:  KFYRRVIYFVERKNDQEDTNSFRWN

A0A1S3BRN0 uncharacterized protein LOC103492827 isoform X21.5e-23079.66Show/hide
Query:  MKKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIP
        MKKRKQEQFD+A EGNN GSV S  E ASS+DKIL ENDP+P AT I+C KLESETGKTLP+ICNS+GNVHE EHNDDQKLS+D DTEHENI+GSDNLI 
Subjt:  MKKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIP

Query:  DAADVKQNVARYSVEMEEPSSTNAYKEDS-GISEDPGGMRDHE--RHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVG
        +AA VKQNVA  SVEM+EP+S NAYKEDS G+SEDPGG+RDHE    GNID+VAQELSKEMIDVRKD HS E  SDP Y LPCNEQEYEGDGSLKS DV 
Subjt:  DAADVKQNVARYSVEMEEPSSTNAYKEDS-GISEDPGGMRDHE--RHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVG

Query:  QINDAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGF
        QI+D FGNNAS+KIVEG VEE SV CS+ EHD+E STSKELIMSTPSC+PP LENAETAKEEVVCF+ASGETSSG++A+A+EK PSLVL+TSEKGDSIG 
Subjt:  QINDAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGF

Query:  STKKLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVEN
        + KKLLVLDVNGLLADFI YVPPGYKPDI+I QKAVFKRPFCDDFIKFCFERFE                               DQSHCTDTTFSTVEN
Subjt:  STKKLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVEN

Query:  KHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEKNPS
        KHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQ YVEQNRFGQRPITEKN S
Subjt:  KHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEKNPS

Query:  WKFYRRVIYFVERKNDQEDTNSFRWN
        WKFYRR+IYFVERKNDQE++N F+WN
Subjt:  WKFYRRVIYFVERKNDQEDTNSFRWN

A0A5D3BIK1 Putative C-terminal domain small phosphatase isoform X21.5e-23079.66Show/hide
Query:  MKKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIP
        MKKRKQEQFD+A EGNN GSV S  E ASS+DKIL ENDP+P AT I+C KLESETGKTLP+ICNS+GNVHE EHNDDQKLS+D DTEHENI+GSDNLI 
Subjt:  MKKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIP

Query:  DAADVKQNVARYSVEMEEPSSTNAYKEDS-GISEDPGGMRDHE--RHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVG
        +AA VKQNVA  SVEM+EP+S NAYKEDS G+SEDPGG+RDHE    GNID+VAQELSKEMIDVRKD HS E  SDP Y LPCNEQEYEGDGSLKS DV 
Subjt:  DAADVKQNVARYSVEMEEPSSTNAYKEDS-GISEDPGGMRDHE--RHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVG

Query:  QINDAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGF
        QI+D FGNNAS+KIVEG VEE SV CS+ EHD+E STSKELIMSTPSC+PP LENAETAKEEVVCF+ASGETSSG++A+A+EK PSLVL+TSEKGDSIG 
Subjt:  QINDAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGF

Query:  STKKLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVEN
        + KKLLVLDVNGLLADFI YVPPGYKPDI+I QKAVFKRPFCDDFIKFCFERFE                               DQSHCTDTTFSTVEN
Subjt:  STKKLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVEN

Query:  KHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEKNPS
        KHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQ YVEQNRFGQRPITEKN S
Subjt:  KHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEKNPS

Query:  WKFYRRVIYFVERKNDQEDTNSFRWN
        WKFYRR+IYFVERKNDQE++N F+WN
Subjt:  WKFYRRVIYFVERKNDQEDTNSFRWN

A0A6J1C778 uncharacterized protein LOC1110088766.0e-21173.53Show/hide
Query:  KKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIPD
        KKRKQEQFD A E NN+GSV S SE+ASSMD ILSENDPAP AT IIC KLESETG++ P ICN +GNVH KEHNDD + SKDMDTEHENINGS +LI +
Subjt:  KKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIPD

Query:  AADVKQNVARYSVEMEEPSSTNAYKEDSGISEDPGGMRDHERHGNIDNVAQELSKEMIDVRKD----DHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVG
          DVKQNVARYS+EMEEPSS NAYKEDS +SEDP   RDH+ HGN+  V QEL+KEMID  KD     HS E  SD +YH PCNEQEYE D SLK+SD+ 
Subjt:  AADVKQNVARYSVEMEEPSSTNAYKEDSGISEDPGGMRDHERHGNIDNVAQELSKEMIDVRKD----DHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVG

Query:  QINDAFGNNASKKIVEGAVEEISVCCSI-SEHDNETSTSKELIMSTPSCMPPELENAETAK--EEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDS
        QIN A GNN S+K V+ A+EE SVCC++  E+D+ETSTSKE I+STPSCMPPE ENA+T K  EEVVCFSASGETS  +DA+ +E  P LVL+TSEKGDS
Subjt:  QINDAFGNNASKKIVEGAVEEISVCCSI-SEHDNETSTSKELIMSTPSCMPPELENAETAK--EEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDS

Query:  IGFSTKKLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFST
        IG S KKLLVLDVNGLLADFI YVP GYKPDI+IGQKAVFKRPFCDDFIKFCFERFE                               DQSHCTDTTFST
Subjt:  IGFSTKKLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFST

Query:  VENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEK
        VEN HKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRD  DTSLGPGGDLRV+LEGLS+AENVQ YVEQN FGQRPITEK
Subjt:  VENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEK

Query:  NPSWKFYRRVIYFVERKNDQEDTNSFRWN
        N SWKFYRR+IYFVER+ND++DTNSF+WN
Subjt:  NPSWKFYRRVIYFVERKNDQEDTNSFRWN

A0A6J1KFT0 uncharacterized protein LOC111495396 isoform X22.5e-19371.51Show/hide
Query:  MKKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIP
        MKKRKQEQFD+A  GNNM SV S SEDASSMDKILSEND A +A FIIC KLESET K    +C    NVHEK+ +DD+K+S+DM TEH NINGS NLI 
Subjt:  MKKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIP

Query:  DAADVKQNVARYSVEMEEPSSTNAYKEDSGISEDPGGMRDHERHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVGQIN
                   YSVEMEEPSS + YK +SGISED GGMR+H+ H N+ NV +ELS+E ID RKD HS E FSD   + PC EQE E D SLKSS V QIN
Subjt:  DAADVKQNVARYSVEMEEPSSTNAYKEDSGISEDPGGMRDHERHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVGQIN

Query:  DAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGFSTK
                     G VEE SV  S+ EHD+ETSTSKELI STP C+PPEL+NAET KEEVVCFS SGETSSGIDA+ +EKTP+LVL+TSEKGDSIG S K
Subjt:  DAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGFSTK

Query:  KLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVENKHK
        KLLVLDVNGLLADFI YVP GYKPD+VIGQKAVFKRPFCDDFIKFCFERFE                               DQS CTDTTFSTVENKHK
Subjt:  KLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVENKHK

Query:  PLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEKNPSWKF
        PLVLK+IKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRV+LEGLSMAENVQ YVE+N FGQRPITEKNPSWKF
Subjt:  PLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEKNPSWKF

Query:  YRRVIYFVERKNDQEDTNSFRWN
        YRR+IYFVER+NDQEDTNSF WN
Subjt:  YRRVIYFVERKNDQEDTNSFRWN

SwissProt top hitse value%identityAlignment
Q9XYL0 Probable C-terminal domain small phosphatase1.9e-0426.92Show/hide
Query:  KLLVLDVNGLLADFIIYVPPGYKPDIV--------IGQKAVFKRPFCDDFIKFCFERFE------DQSHCTDTTFSTVE--------------NKHKPLV
        K LVLD++  L        P + PD +        I Q  V KRPF DDF++   E+FE        +   D     ++              + HK   
Subjt:  KLLVLDVNGLLADFIIYVPPGYKPDIV--------IGQKAVFKRPFCDDFIKFCFERFE------DQSHCTDTTFSTVE--------------NKHKPLV

Query:  LKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNR
        +K++ +L + LK       +T+++D+SP   L +P N    P+   F D DD  L    DL   L+ L   E+V+  ++++R
Subjt:  LKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNR

Arabidopsis top hitse value%identityAlignment
AT2G36540.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein1.2e-3334.69Show/hide
Query:  ETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGFSTKKLLVLDVNGLLADFI-----IYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFE
        E  K+ ++    S +  S  D V+ +   S +L+            KKLLVL ++GLL   +        P    PD   G   V+KRPF ++F+KFC E
Subjt:  ETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGFSTKKLLVLDVNGLLADFI-----IYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFE

Query:  RFE--------------------DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLG
        RFE                        CTD+ + T+EN++KPL  K++ K++K  K   F+ASNT+ +DD P+KAL NP NT +FP++Y   +  D  L 
Subjt:  RFE--------------------DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLG

Query:  PGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEKNPSWKFYRRV
        P G+L  +LEGL+ + +VQ Y++ + FG+  I   +P W FY  V
Subjt:  PGGDLRVFLEGLSMAENVQNYVEQNRFGQRPITEKNPSWKFYRRV

AT2G36550.1 CONTAINS InterPro DOMAIN/s: NLI interacting factor (InterPro:IPR004274)9.9e-2541.8Show/hide
Query:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVE
        DQ  CTD+ + T+EN  KPL  K++ K+++  K   F+ASNT+ +++ P+KAL NP NT +FP++Y   DT D  L P G+   +L+GL+ + +VQ Y++
Subjt:  DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVE

Query:  QNRFGQRPITEKNPSWKFYRRV
        ++ FGQ  I   +  W +YRRV
Subjt:  QNRFGQRPITEKNPSWKFYRRV

AT3G29760.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein1.2e-2231.78Show/hide
Query:  EEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGFSTKKLLVLDVNGLLADFII
        +E SV  +++ +D        ++ +  SC+    E  E  +E     S      S  +    E   S V+     G +     KKLLVLD+NGLLAD I+
Subjt:  EEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGFSTKKLLVLDVNGLLADFII

Query:  YVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKP
                DI IG++A+FKRPFCD+F++FCF++FE                               D S+C  T+  ++EN++K +V K++ +LW+   P
Subjt:  YVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE-------------------------------DQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKP

Query:  R------EFNASNTLLLDDSPHKALCNPANTAIFPV
        R      ++N +NT+LLDDSP+KAL NP  + I  +
Subjt:  R------EFNASNTLLLDDSPHKALCNPANTAIFPV

AT4G26190.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein6.6e-3727.13Show/hide
Query:  KRKQEQFDNALEGNNMG-SVLSSSEDASSMD--KILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLI
        KRK+   D  +E N         S+D    +  KI ++ +    AT      ++SE+ ++L    N +G  + + + D +  SKD+ +  E+       +
Subjt:  KRKQEQFDNALEGNNMG-SVLSSSEDASSMD--KILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLI

Query:  PDAADVKQNVARYSVEMEEPSSTNAYKEDSGISEDPGGMRDHERHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVGQI
         D  + K+       E+++ +     K   G+S              +     +  K ++D + D+   ++    E                K  +V Q 
Subjt:  PDAADVKQNVARYSVEMEEPSSTNAYKEDSGISEDPGGMRDHERHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVGQI

Query:  NDAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGD-----S
        ND      SK  VE   ++     +  +H  +    K+ +      +P + E  E  +E++       ETS     + Q    +  + +SE GD      
Subjt:  NDAFGNNASKKIVEGAVEEISVCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGD-----S

Query:  IGFSTKKLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE------------------------------DQSHCTDTTFSTV
            T+KL++ D+NG+LAD +      + PD  +  ++VF+RPF   F+ FCFERF+                              DQ+ CT T F T 
Subjt:  IGFSTKKLLVLDVNGLLADFIIYVPPGYKPDIVIGQKAVFKRPFCDDFIKFCFERFE------------------------------DQSHCTDTTFSTV

Query:  ENKHKPLVLKEIKKLWKYL------KPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQR
        E K KPL LK+++++W ++        R+++ +NTLL+DDSP KALCNP +T IFP  Y++ +  D++LGP G+LR +LE L+ AENVQ +V +N FGQ 
Subjt:  ENKHKPLVLKEIKKLWKYL------KPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGLSMAENVQNYVEQNRFGQR

Query:  PITEKNPSWKFYRRVI
         ITE + SW+FY + +
Subjt:  PITEKNPSWKFYRRVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAGAGAAAGCAAGAACAGTTTGATAATGCTCTTGAAGGAAACAATATGGGTAGTGTTCTTTCTAGTTCTGAAGATGCATCTTCAATGGATAAAATCCTGTCTGA
AAATGATCCAGCTCCAGCTGCCACGTTCATTATATGCCCAAAGTTGGAATCTGAAACAGGAAAAACTCTTCCAGATATATGTAATTCAAAAGGAAATGTTCATGAAAAGG
AGCATAATGATGACCAAAAATTGTCAAAAGATATGGATACAGAGCATGAAAATATCAATGGTTCTGATAATCTTATCCCGGATGCAGCAGATGTAAAACAAAATGTTGCT
AGATATAGTGTTGAAATGGAAGAGCCAAGTTCAACGAACGCTTACAAAGAAGACTCTGGGATCTCTGAAGATCCAGGTGGTATGAGAGACCATGAACGTCATGGAAACAT
TGACAATGTTGCCCAAGAACTGAGCAAGGAGATGATAGATGTGAGGAAGGATGATCATTCTGGAGAAAATTTCTCTGACCCTGAGTATCATTTGCCATGTAATGAACAGG
AATACGAGGGGGATGGTTCATTGAAAAGTTCAGATGTAGGACAGATAAATGATGCATTTGGTAATAATGCTTCAAAGAAGATTGTGGAAGGCGCCGTGGAAGAAATTTCT
GTTTGCTGTTCAATCAGTGAGCATGATAATGAAACTTCAACAAGCAAGGAACTGATTATGTCAACTCCTTCCTGCATGCCTCCTGAACTGGAAAATGCTGAAACTGCGAA
GGAAGAAGTTGTATGTTTCTCAGCTTCTGGTGAGACAAGCAGTGGTATTGATGCTGTTGCTCAAGAGAAAACTCCGTCGCTGGTATTGAATACTTCAGAGAAAGGAGATT
CTATTGGTTTTTCAACGAAAAAGCTTCTTGTTCTCGATGTAAATGGACTGCTTGCAGATTTTATTATTTATGTTCCACCTGGATATAAGCCAGACATTGTAATAGGACAA
AAAGCAGTGTTCAAGAGGCCATTTTGTGATGATTTTATTAAGTTTTGTTTTGAAAGATTCGAGGATCAATCACACTGTACCGACACCACGTTCTCTACCGTAGAGAACAA
GCACAAGCCTTTAGTCTTAAAGGAAATAAAAAAACTGTGGAAATACCTTAAGCCACGAGAGTTCAATGCATCAAACACTCTGCTGCTGGATGATTCCCCACACAAAGCAT
TGTGCAATCCGGCAAACACTGCAATATTTCCTGTAACATATCGGTTTAGGGATACTGACGATACGTCATTAGGACCAGGAGGCGATCTTCGGGTTTTTCTGGAAGGTTTA
TCGATGGCAGAAAACGTTCAAAACTACGTTGAGCAGAATCGTTTTGGTCAACGTCCCATTACAGAAAAGAACCCGTCTTGGAAGTTTTATAGGCGGGTCATATATTTTGT
TGAGCGCAAAAACGATCAGGAGGATACGAATTCTTTCAGATGGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAAGAGAAAGCAAGAACAGTTTGATAATGCTCTTGAAGGAAACAATATGGGTAGTGTTCTTTCTAGTTCTGAAGATGCATCTTCAATGGATAAAATCCTGTCTGA
AAATGATCCAGCTCCAGCTGCCACGTTCATTATATGCCCAAAGTTGGAATCTGAAACAGGAAAAACTCTTCCAGATATATGTAATTCAAAAGGAAATGTTCATGAAAAGG
AGCATAATGATGACCAAAAATTGTCAAAAGATATGGATACAGAGCATGAAAATATCAATGGTTCTGATAATCTTATCCCGGATGCAGCAGATGTAAAACAAAATGTTGCT
AGATATAGTGTTGAAATGGAAGAGCCAAGTTCAACGAACGCTTACAAAGAAGACTCTGGGATCTCTGAAGATCCAGGTGGTATGAGAGACCATGAACGTCATGGAAACAT
TGACAATGTTGCCCAAGAACTGAGCAAGGAGATGATAGATGTGAGGAAGGATGATCATTCTGGAGAAAATTTCTCTGACCCTGAGTATCATTTGCCATGTAATGAACAGG
AATACGAGGGGGATGGTTCATTGAAAAGTTCAGATGTAGGACAGATAAATGATGCATTTGGTAATAATGCTTCAAAGAAGATTGTGGAAGGCGCCGTGGAAGAAATTTCT
GTTTGCTGTTCAATCAGTGAGCATGATAATGAAACTTCAACAAGCAAGGAACTGATTATGTCAACTCCTTCCTGCATGCCTCCTGAACTGGAAAATGCTGAAACTGCGAA
GGAAGAAGTTGTATGTTTCTCAGCTTCTGGTGAGACAAGCAGTGGTATTGATGCTGTTGCTCAAGAGAAAACTCCGTCGCTGGTATTGAATACTTCAGAGAAAGGAGATT
CTATTGGTTTTTCAACGAAAAAGCTTCTTGTTCTCGATGTAAATGGACTGCTTGCAGATTTTATTATTTATGTTCCACCTGGATATAAGCCAGACATTGTAATAGGACAA
AAAGCAGTGTTCAAGAGGCCATTTTGTGATGATTTTATTAAGTTTTGTTTTGAAAGATTCGAGGATCAATCACACTGTACCGACACCACGTTCTCTACCGTAGAGAACAA
GCACAAGCCTTTAGTCTTAAAGGAAATAAAAAAACTGTGGAAATACCTTAAGCCACGAGAGTTCAATGCATCAAACACTCTGCTGCTGGATGATTCCCCACACAAAGCAT
TGTGCAATCCGGCAAACACTGCAATATTTCCTGTAACATATCGGTTTAGGGATACTGACGATACGTCATTAGGACCAGGAGGCGATCTTCGGGTTTTTCTGGAAGGTTTA
TCGATGGCAGAAAACGTTCAAAACTACGTTGAGCAGAATCGTTTTGGTCAACGTCCCATTACAGAAAAGAACCCGTCTTGGAAGTTTTATAGGCGGGTCATATATTTTGT
TGAGCGCAAAAACGATCAGGAGGATACGAATTCTTTCAGATGGAACTGA
Protein sequenceShow/hide protein sequence
MKKRKQEQFDNALEGNNMGSVLSSSEDASSMDKILSENDPAPAATFIICPKLESETGKTLPDICNSKGNVHEKEHNDDQKLSKDMDTEHENINGSDNLIPDAADVKQNVA
RYSVEMEEPSSTNAYKEDSGISEDPGGMRDHERHGNIDNVAQELSKEMIDVRKDDHSGENFSDPEYHLPCNEQEYEGDGSLKSSDVGQINDAFGNNASKKIVEGAVEEIS
VCCSISEHDNETSTSKELIMSTPSCMPPELENAETAKEEVVCFSASGETSSGIDAVAQEKTPSLVLNTSEKGDSIGFSTKKLLVLDVNGLLADFIIYVPPGYKPDIVIGQ
KAVFKRPFCDDFIKFCFERFEDQSHCTDTTFSTVENKHKPLVLKEIKKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVFLEGL
SMAENVQNYVEQNRFGQRPITEKNPSWKFYRRVIYFVERKNDQEDTNSFRWN