; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012532 (gene) of Snake gourd v1 genome

Gene IDTan0012532
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionFCP1 homology domain-containing protein
Genome locationLG05:74902794..74906280
RNA-Seq ExpressionTan0012532
SyntenyTan0012532
Gene Ontology termsNA
InterPro domainsIPR004274 - FCP1 homology domain
IPR023214 - HAD superfamily
IPR036412 - HAD-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019988.1 hypothetical protein SDJN02_18956, partial [Cucurbita argyrosperma subsp. argyrosperma]4.6e-22775.88Show/hide
Query:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG
        MD+SECDT+E  EHKMKKRK EQFD+AP  +N  SVHSGSEDASSMD ILSEND A  A FI CSKLESET          K  +VHEK+ DDD+K+ + 
Subjt:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG

Query:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDPRISEDLGGMRDHDDHGNFANVGQELSQEMIDVRKDDHSREKFSDAGYQFPCNEQE
        IATEH N+NGSGNLI            +SV+MEEPSS    + +  +SED GGMR+HDDH N ANV +ELS+E +DVRKD HSRE+FSD+G  FPCNEQE
Subjt:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDPRISEDLGGMRDHDDHGNFANVGQELSQEMIDVRKDDHSREKFSDAGYQFPCNEQE

Query:  YERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDPDDETSSSKEWIIPTPSGMPPELENAETVKQEVVCFSASGETSSGVNAITEERTPSQ
        +ERD SLKSSD++QING            G VEEA V  SVG+ DDETS+SKE II TP  +PPEL+NA+TVK+EVVCFS SGETSSGV+AI EE+TP+ 
Subjt:  YERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDPDDETSSSKEWIIPTPSGMPPELENAETVKQEVVCFSASGETSSGVNAITEERTPSQ

Query:  VLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCWDQ
        VLDTS+KGDSI CSRKKLLVLDVNGLLADFICYVPYGYKPD+VI QKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDM IDFLMGD R+KLLFCWDQ
Subjt:  VLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCWDQ

Query:  SHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVEQN
        SHCTDT FSTVENKHKPLVLK+I+KLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVEQN
Subjt:  SHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVEQN

Query:  PFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN
        PFGQRPITEKN SWKFYRRIIYFVER+NDQ+D NS+ WN
Subjt:  PFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN

XP_008451760.1 PREDICTED: uncharacterized protein LOC103492827 isoform X2 [Cucumis melo]5.1e-24279.52Show/hide
Query:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG
        MDISECDT+E LEHKMKKRK EQFD+A E +NTGSV SG E ASS+D IL ENDP+PDAT I CSKLESETG+T PEI N + N VHE EH+DDQKL + 
Subjt:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG

Query:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDP-RISEDLGGMRDHD--DHGNFANVGQELSQEMIDVRKDDHSREKFSDAGYQFPCN
          TEHEN++GS NLILN A VKQNVA  SV+M+EP+S  AY+ED   +SED GG+RDH+  D GN  +V QELS+EMIDVRKD HSREK SD  Y  PCN
Subjt:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDP-RISEDLGGMRDHD--DHGNFANVGQELSQEMIDVRKDDHSREKFSDAGYQFPCN

Query:  EQEYERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDPDDETSSSKEWIIPTPSGMPPELENAETVKQEVVCFSASGETSSGVNAITEERT
        EQEYE DGSLKS D++QI+   GNNASEKIVEG VEEA V CSVG+ DDE S+SKE I+ TPS +PP LENAET K+EVVCF+ASGETSSGVNA+ EE+ 
Subjt:  EQEYERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDPDDETSSSKEWIIPTPSGMPPELENAETVKQEVVCFSASGETSSGVNAITEERT

Query:  PSQVLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFC
        PS VLDTSEKGDSIG +RKKLLVLDVNGLLADFICYVP GYKPDI+IRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGD R KLLFC
Subjt:  PSQVLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFC

Query:  WDQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYV
        WDQSHCTDT FSTVENKHKPLVLKEI+KLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRV+LEGLSMAENVQKYV
Subjt:  WDQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYV

Query:  EQNPFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN
        EQN FGQRPITEKN SWKFYRRIIYFVER+NDQ+++N ++WN
Subjt:  EQNPFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN

XP_022137426.1 uncharacterized protein LOC111008876 [Momordica charantia]1.6e-24880.77Show/hide
Query:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG
        MDISE DTS DLEHK KKRK EQFD APE +N GSV SGSE+ASSMDNILSENDPAPDAT I CSKLESETGQ+HP I NP+ N VH KEH+DD +  K 
Subjt:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG

Query:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDPRISEDLGGMRDHDDHGNFANVGQELSQEMIDVRKD----DHSREKFSDAGYQFPC
        + TEHEN+NGS +LILNT DVKQNVAR+S++MEEPSS  AY+ED R+SED    RDHDDHGN A V QEL++EMID  KD     HS EK SD+ Y FPC
Subjt:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDPRISEDLGGMRDHDDHGNFANVGQELSQEMIDVRKD----DHSREKFSDAGYQFPC

Query:  NEQEYERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDP-DDETSSSKEWIIPTPSGMPPELENAETVK--QEVVCFSASGETSSGVNAIT
        NEQEYERD SLK+SDI+QINGACGNN SEK V+  +EEA VCC+VG+  DDETS+SKE II TPS MPPE ENA+TVK  +EVVCFSASGETS  ++AI 
Subjt:  NEQEYERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDP-DDETSSSKEWIIPTPSGMPPELENAETVK--QEVVCFSASGETSSGVNAIT

Query:  EERTPSQVLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRK
        EE  P  VLDTSEKGDSIG SRKKLLVLDVNGLLADFICYVPYGYKPDI+I QKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDL++K
Subjt:  EERTPSQVLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRK

Query:  LLFCWDQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENV
        LLFCWDQSHCTDT FSTVEN HKPLVLKEI+KLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRD  DTSLGPGGDLRVYLEGLS+AENV
Subjt:  LLFCWDQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENV

Query:  QKYVEQNPFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN
        QKYVEQNPFGQRPITEKNLSWKFYRRIIYFVERQND+DDTNS+KWN
Subjt:  QKYVEQNPFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN

XP_031738119.1 uncharacterized protein LOC101203219 isoform X2 [Cucumis sativus]2.4e-23978.74Show/hide
Query:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG
        MDIS CDT+E LEHKMKKRK EQFD+A E +NTGSV SG E ASSMD IL E DP+PDAT + CSKLESETG+  PEI N K N VHEKEH+DD+KL K 
Subjt:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG

Query:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDPRISEDLGGMRDHD--DHGNFANVGQELSQEMIDVRKDDHSREKFSDAGYQFPCNE
          TE+EN+NGS NLILN A+VKQNVA +SV+MEEPSS  AY+ED  ISED GG+R H+  D GN   V QELS+EMIDV+KD HSREK SD  Y  PCNE
Subjt:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDPRISEDLGGMRDHD--DHGNFANVGQELSQEMIDVRKDDHSREKFSDAGYQFPCNE

Query:  QEYERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDPDDETSSSKEWIIPTPSGMPPELENAETVKQEVVCFSASGETSSGVNAITEERTP
         EY+ DGSLKS D++QIN   GNNASEKIVEG VEEA VCCS G+ DDE S+ KE I+ TPS +PP LENAET K+EVVCF+ SGETSS VNA+ EE TP
Subjt:  QEYERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDPDDETSSSKEWIIPTPSGMPPELENAETVKQEVVCFSASGETSSGVNAITEERTP

Query:  SQVLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCW
          VLDTSEKGDSIG + KKLLVLDVNGLLADFICYVP GYKPDI+IRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLM D R KLLFCW
Subjt:  SQVLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCW

Query:  DQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVE
        DQSHCTDT FSTVENKHKPLVLKEI+KLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRV+LEGLSMAENVQKYVE
Subjt:  DQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVE

Query:  QNPFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN
        QN FGQRPITEKN SWKFYRRIIYFVER+NDQ++ NS++WN
Subjt:  QNPFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN

XP_038894827.1 uncharacterized protein LOC120083233 isoform X1 [Benincasa hispida]1.1e-24982Show/hide
Query:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG
        MDISECDT E LEHKMKKRK EQFD+  E +N GSVHSGS+D  SMD ILSENDPA DA FI CSKLESETG+T PEI N K N VH KEH+DDQKL K 
Subjt:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG

Query:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDPRISEDLGGMRDHDDHGNFANVGQELSQEMIDVRKDDHSREKFSDAGYQFPCNEQE
        + TEH+N+NG  NLILNT +VK+NVAR+SV++EEPSS  AY+ED  ISED GGM DHDDHGN  +V QELS+EMIDVRKDDHSREKFSD  Y  PCNE+E
Subjt:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDPRISEDLGGMRDHDDHGNFANVGQELSQEMIDVRKDDHSREKFSDAGYQFPCNEQE

Query:  YERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDPDDETSSSKEWIIPTPSGMPPELENAETVKQEVVCFSASGETSSGVNAITEERTPSQ
         E DGSLKSS+++QIN   GNNASEKIVEG VEE  VCCSV + DDETS+SKE I+ TP  +PPELENAET K+E VCFSASGETSSGV+AI EE+TPS 
Subjt:  YERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDPDDETSSSKEWIIPTPSGMPPELENAETVKQEVVCFSASGETSSGVNAITEERTPSQ

Query:  VLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCWDQ
        VLDTSEKGDSIG SRKKLLVLDVNGLLADFI YVP GYKPDIVI QKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGD R KLLFCWDQ
Subjt:  VLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCWDQ

Query:  SHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVEQN
        SHCTDT FSTVENKHKPLVLKEI+KLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRV+LEGLSMAENVQKYVEQN
Subjt:  SHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVEQN

Query:  PFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN
         FGQRPITEKN SWKFYRRIIYFVER+NDQ+DTNS+KWN
Subjt:  PFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN

TrEMBL top hitse value%identityAlignment
A0A0A0LSV7 FCP1 homology domain-containing protein1.1e-23978.74Show/hide
Query:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG
        MDIS CDT+E LEHKMKKRK EQFD+A E +NTGSV SG E ASSMD IL E DP+PDAT + CSKLESETG+  PEI N K N VHEKEH+DD+KL K 
Subjt:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG

Query:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDPRISEDLGGMRDHD--DHGNFANVGQELSQEMIDVRKDDHSREKFSDAGYQFPCNE
          TE+EN+NGS NLILN A+VKQNVA +SV+MEEPSS  AY+ED  ISED GG+R H+  D GN   V QELS+EMIDV+KD HSREK SD  Y  PCNE
Subjt:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDPRISEDLGGMRDHD--DHGNFANVGQELSQEMIDVRKDDHSREKFSDAGYQFPCNE

Query:  QEYERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDPDDETSSSKEWIIPTPSGMPPELENAETVKQEVVCFSASGETSSGVNAITEERTP
         EY+ DGSLKS D++QIN   GNNASEKIVEG VEEA VCCS G+ DDE S+ KE I+ TPS +PP LENAET K+EVVCF+ SGETSS VNA+ EE TP
Subjt:  QEYERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDPDDETSSSKEWIIPTPSGMPPELENAETVKQEVVCFSASGETSSGVNAITEERTP

Query:  SQVLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCW
          VLDTSEKGDSIG + KKLLVLDVNGLLADFICYVP GYKPDI+IRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLM D R KLLFCW
Subjt:  SQVLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCW

Query:  DQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVE
        DQSHCTDT FSTVENKHKPLVLKEI+KLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRV+LEGLSMAENVQKYVE
Subjt:  DQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVE

Query:  QNPFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN
        QN FGQRPITEKN SWKFYRRIIYFVER+NDQ++ NS++WN
Subjt:  QNPFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN

A0A1S3BRN0 uncharacterized protein LOC103492827 isoform X22.5e-24279.52Show/hide
Query:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG
        MDISECDT+E LEHKMKKRK EQFD+A E +NTGSV SG E ASS+D IL ENDP+PDAT I CSKLESETG+T PEI N + N VHE EH+DDQKL + 
Subjt:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG

Query:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDP-RISEDLGGMRDHD--DHGNFANVGQELSQEMIDVRKDDHSREKFSDAGYQFPCN
          TEHEN++GS NLILN A VKQNVA  SV+M+EP+S  AY+ED   +SED GG+RDH+  D GN  +V QELS+EMIDVRKD HSREK SD  Y  PCN
Subjt:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDP-RISEDLGGMRDHD--DHGNFANVGQELSQEMIDVRKDDHSREKFSDAGYQFPCN

Query:  EQEYERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDPDDETSSSKEWIIPTPSGMPPELENAETVKQEVVCFSASGETSSGVNAITEERT
        EQEYE DGSLKS D++QI+   GNNASEKIVEG VEEA V CSVG+ DDE S+SKE I+ TPS +PP LENAET K+EVVCF+ASGETSSGVNA+ EE+ 
Subjt:  EQEYERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDPDDETSSSKEWIIPTPSGMPPELENAETVKQEVVCFSASGETSSGVNAITEERT

Query:  PSQVLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFC
        PS VLDTSEKGDSIG +RKKLLVLDVNGLLADFICYVP GYKPDI+IRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGD R KLLFC
Subjt:  PSQVLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFC

Query:  WDQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYV
        WDQSHCTDT FSTVENKHKPLVLKEI+KLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRV+LEGLSMAENVQKYV
Subjt:  WDQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYV

Query:  EQNPFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN
        EQN FGQRPITEKN SWKFYRRIIYFVER+NDQ+++N ++WN
Subjt:  EQNPFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN

A0A5D3BIK1 Putative C-terminal domain small phosphatase isoform X22.5e-24279.52Show/hide
Query:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG
        MDISECDT+E LEHKMKKRK EQFD+A E +NTGSV SG E ASS+D IL ENDP+PDAT I CSKLESETG+T PEI N + N VHE EH+DDQKL + 
Subjt:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG

Query:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDP-RISEDLGGMRDHD--DHGNFANVGQELSQEMIDVRKDDHSREKFSDAGYQFPCN
          TEHEN++GS NLILN A VKQNVA  SV+M+EP+S  AY+ED   +SED GG+RDH+  D GN  +V QELS+EMIDVRKD HSREK SD  Y  PCN
Subjt:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDP-RISEDLGGMRDHD--DHGNFANVGQELSQEMIDVRKDDHSREKFSDAGYQFPCN

Query:  EQEYERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDPDDETSSSKEWIIPTPSGMPPELENAETVKQEVVCFSASGETSSGVNAITEERT
        EQEYE DGSLKS D++QI+   GNNASEKIVEG VEEA V CSVG+ DDE S+SKE I+ TPS +PP LENAET K+EVVCF+ASGETSSGVNA+ EE+ 
Subjt:  EQEYERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDPDDETSSSKEWIIPTPSGMPPELENAETVKQEVVCFSASGETSSGVNAITEERT

Query:  PSQVLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFC
        PS VLDTSEKGDSIG +RKKLLVLDVNGLLADFICYVP GYKPDI+IRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGD R KLLFC
Subjt:  PSQVLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFC

Query:  WDQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYV
        WDQSHCTDT FSTVENKHKPLVLKEI+KLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRV+LEGLSMAENVQKYV
Subjt:  WDQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYV

Query:  EQNPFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN
        EQN FGQRPITEKN SWKFYRRIIYFVER+NDQ+++N ++WN
Subjt:  EQNPFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN

A0A6J1C778 uncharacterized protein LOC1110088767.9e-24980.77Show/hide
Query:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG
        MDISE DTS DLEHK KKRK EQFD APE +N GSV SGSE+ASSMDNILSENDPAPDAT I CSKLESETGQ+HP I NP+ N VH KEH+DD +  K 
Subjt:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG

Query:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDPRISEDLGGMRDHDDHGNFANVGQELSQEMIDVRKD----DHSREKFSDAGYQFPC
        + TEHEN+NGS +LILNT DVKQNVAR+S++MEEPSS  AY+ED R+SED    RDHDDHGN A V QEL++EMID  KD     HS EK SD+ Y FPC
Subjt:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDPRISEDLGGMRDHDDHGNFANVGQELSQEMIDVRKD----DHSREKFSDAGYQFPC

Query:  NEQEYERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDP-DDETSSSKEWIIPTPSGMPPELENAETVK--QEVVCFSASGETSSGVNAIT
        NEQEYERD SLK+SDI+QINGACGNN SEK V+  +EEA VCC+VG+  DDETS+SKE II TPS MPPE ENA+TVK  +EVVCFSASGETS  ++AI 
Subjt:  NEQEYERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDP-DDETSSSKEWIIPTPSGMPPELENAETVK--QEVVCFSASGETSSGVNAIT

Query:  EERTPSQVLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRK
        EE  P  VLDTSEKGDSIG SRKKLLVLDVNGLLADFICYVPYGYKPDI+I QKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDL++K
Subjt:  EERTPSQVLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRK

Query:  LLFCWDQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENV
        LLFCWDQSHCTDT FSTVEN HKPLVLKEI+KLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRD  DTSLGPGGDLRVYLEGLS+AENV
Subjt:  LLFCWDQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENV

Query:  QKYVEQNPFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN
        QKYVEQNPFGQRPITEKNLSWKFYRRIIYFVERQND+DDTNS+KWN
Subjt:  QKYVEQNPFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN

A0A6J1KFT0 uncharacterized protein LOC111495396 isoform X21.6e-22575.7Show/hide
Query:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG
        MD+SECDT+E LEHKMKKRK EQFD+AP  +N  SVHSGSEDASSMD ILSEND A  A FI CSKLESET          K  +VHEK+ DDD+K+ + 
Subjt:  MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKG

Query:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDPRISEDLGGMRDHDDHGNFANVGQELSQEMIDVRKDDHSREKFSDAGYQFPCNEQE
        +ATEH N+NGSGNLI            +SV+MEEPSS   Y+ +  ISED GGMR+HDDH N  NV +ELS+E ID RKD HSRE+FSD G  FPC EQE
Subjt:  IATEHENMNGSGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDPRISEDLGGMRDHDDHGNFANVGQELSQEMIDVRKDDHSREKFSDAGYQFPCNEQE

Query:  YERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDPDDETSSSKEWIIPTPSGMPPELENAETVKQEVVCFSASGETSSGVNAITEERTPSQ
         ERD SLKSS ++QING            G VEEA V  SVG+ DDETS+SKE I  TP  +PPEL+NAETVK+EVVCFS SGETSSG++AI EE+TP+ 
Subjt:  YERDGSLKSSDIDQINGACGNNASEKIVEGDVEEALVCCSVGDPDDETSSSKEWIIPTPSGMPPELENAETVKQEVVCFSASGETSSGVNAITEERTPSQ

Query:  VLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCWDQ
        VLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPD+VI QKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDM IDFLMGD R+KLLFCWDQ
Subjt:  VLDTSEKGDSIGCSRKKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCWDQ

Query:  SHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVEQN
        S CTDT FSTVENKHKPLVLK+I+KLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVE+N
Subjt:  SHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVEQN

Query:  PFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN
        PFGQRPITEKN SWKFYRRIIYFVER+NDQ+DTNS+ WN
Subjt:  PFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN

SwissProt top hitse value%identityAlignment
O94336 Uncharacterized FCP1 homology domain-containing protein C1271.03c5.7e-1026.24Show/hide
Query:  KKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFK-------RPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRK-LLFCWDQSHCTDTK
        +KL++LD+NG L   +C      +   V  +K+V++       RP   +F+K+ F  F V V+SS    NV  ++  +M + ++K L+ CW +    D K
Subjt:  KKLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFK-------RPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRK-LLFCWDQSHCTDTK

Query:  FSTVENKHKPLVLKEIRKLWKYL------KPREFNASNTLLLDDSPHKALCNPAN-TAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVEQN
         +  +   K    K +  +W+ +      KP  ++  NT+++DDS  K   +P N  A+     +        +     +R YL+ L    NV  Y+ + 
Subjt:  FSTVENKHKPLVLKEIRKLWKYL------KPREFNASNTLLLDDSPHKALCNPAN-TAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVEQN

Query:  PF
        PF
Subjt:  PF

Q8T3G2 CTD small phosphatase-like protein 13.2e-0528.65Show/hide
Query:  SRKKLLVLDVNGLLA----------DFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCWDQSHC
        S KK LV+D++  L           DF+  V    + D V  Q  V KRP+ D+F+    E FE  ++++   +  D V D L  D +R       +  C
Subjt:  SRKKLLVLDVNGLLA----------DFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCWDQSHC

Query:  TDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSL
                   HK   +K++ +L +       N + TL++D+SP     +P N    PVT  F D  DT L
Subjt:  TDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSL

Q9XYL0 Probable C-terminal domain small phosphatase1.3e-0929.17Show/hide
Query:  KLLVLDVNGLLADFICYVPYGYKPDIV--------IRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCWDQSHCTDTKF
        K LVLD++  L     + P  + PD +        I Q  V KRPF DDF++   E+FE+ V+++   +  D V+DFL  D  R + +   +  C     
Subjt:  KLLVLDVNGLLADFICYVPYGYKPDIV--------IRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCWDQSHCTDTKF

Query:  STVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVEQN
            + HK   +K++ +L + LK       +T+++D+SP   L +P N    P+   F D DD  L    DL   L+ L   E+V+  ++++
Subjt:  STVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVEQN

Arabidopsis top hitse value%identityAlignment
AT2G36540.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein3.5e-3935.16Show/hide
Query:  ETVKQEVVCFSASGETSSGVNAITEERTPSQVLDTSEKGDSIGCSRKKLLVLDVNGLLADFI-----CYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFE
        E +K+ ++    S +  S  + ++++   S +LD           +KKLLVL ++GLL   +        P    PD       V+KRPF ++F+KFC E
Subjt:  ETVKQEVVCFSASGETSSGVNAITEERTPSQVLDTSEKGDSIGCSRKKLLVLDVNGLLADFI-----CYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFE

Query:  RFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCWDQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTY
        RFEVG+WSS            L+  L   +L       CTD+ + T+EN++KPL  K++ K++K  K   F+ASNT+ +DD P+KAL NP NT +FP++Y
Subjt:  RFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCWDQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTY

Query:  RFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVEQNPFGQRPITEKNLSWKFYRRI
           +  D  L P G+L  YLEGL+ + +VQ Y++ + FG+  I   +  W FY  +
Subjt:  RFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVEQNPFGQRPITEKNLSWKFYRRI

AT2G36550.1 CONTAINS InterPro DOMAIN/s: NLI interacting factor (InterPro:IPR004274)1.4e-2743.44Show/hide
Query:  DQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVE
        DQ  CTD+ + T+EN  KPL  K++ K+++  K   F+ASNT+ +++ P+KAL NP NT +FP++Y   DT D  L P G+   YL+GL+ + +VQ Y++
Subjt:  DQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVE

Query:  QNPFGQRPITEKNLSWKFYRRI
        ++PFGQ  I   +L W +YRR+
Subjt:  QNPFGQRPITEKNLSWKFYRRI

AT3G29760.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein5.2e-4351.61Show/hide
Query:  RKKLLVLDVNGLLADFICYVPYGYKP-DIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCWDQSHCTDTKFSTVEN
        RKKLLVLD+NGLLAD +   P    P DI I ++A+FKRPFCD+F++FCF++FEVG+WSSR + NV  + +FL+GDL+ KLLFCWD S+C  T   ++EN
Subjt:  RKKLLVLDVNGLLADFICYVPYGYKP-DIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCWDQSHCTDTKFSTVEN

Query:  KHKPLVLKEIRKLWKYLKPR------EFNASNTLLLDDSPHKALCNPANTAIFPV
        ++K +V K++ +LW+   PR      ++N +NT+LLDDSP+KAL NP  + I  +
Subjt:  KHKPLVLKEIRKLWKYLKPR------EFNASNTLLLDDSPHKALCNPANTAIFPV

AT4G26190.1 Haloacid dehalogenase-like hydrolase (HAD) superfamily protein1.8e-5945.38Show/hide
Query:  ETSSGVNAITEERTPSQVLDTSEKGDSIGCSRK-----KLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRN
        ETS   + + +    +Q + +SE GD   C+ K     KL++ D+NG+LAD +      + PD  +  ++VF+RPF   F+ FCFERF+V +WSSR R  
Subjt:  ETSSGVNAITEERTPSQVLDTSEKGDSIGCSRK-----KLLVLDVNGLLADFICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRN

Query:  VDMVIDFLMGDLRRKLLFCWDQSHCTDTKFSTVENKHKPLVLKEIRKLWKYL------KPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDT
        +D +I+ +M +  R LLFC+DQ+ CT TKF T E K KPL LK++R++W ++        R+++ +NTLL+DDSP KALCNP +T IFP  Y++ +  D+
Subjt:  VDMVIDFLMGDLRRKLLFCWDQSHCTDTKFSTVENKHKPLVLKEIRKLWKYL------KPREFNASNTLLLDDSPHKALCNPANTAIFPVTYRFRDTDDT

Query:  SLGPGGDLRVYLEGLSMAENVQKYVEQNPFGQRPITEKNLSWKFYRRII
        +LGP G+LR YLE L+ AENVQK+V +NPFGQ  ITE + SW+FY + +
Subjt:  SLGPGGDLRVYLEGLSMAENVQKYVEQNPFGQRPITEKNLSWKFYRRII


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATTTCAGAATGTGATACAAGTGAAGATTTGGAGCATAAAATGAAAAAGAGAAAGCTAGAACAGTTTGATAATGCTCCTGAAGATAGCAATACGGGCAGTGTTCA
TTCTGGTTCTGAAGATGCGTCTTCAATGGATAATATCCTGTCGGAAAATGATCCAGCTCCTGATGCCACGTTCATTAAATGCTCAAAGCTCGAATCTGAAACAGGACAAA
CTCATCCAGAGATATATAATCCAAAGGCAAATGATGTTCATGAAAAGGAACATGATGATGACCAAAAATTGTTAAAGGGCATTGCTACAGAGCATGAAAATATGAATGGT
TCTGGTAATCTTATCCTGAATACAGCAGATGTCAAACAAAATGTTGCTAGATTTAGTGTCAAAATGGAAGAGCCAAGTTCCACGAAAGCTTACGAAGAAGACCCTAGGAT
CTCTGAAGATCTTGGTGGTATGAGAGATCATGATGATCATGGAAACTTTGCCAATGTTGGCCAAGAACTGAGCCAGGAGATGATAGATGTGAGGAAGGATGATCATTCTA
GAGAAAAATTCTCTGACGCTGGTTATCAATTTCCATGTAATGAGCAGGAATATGAGAGGGATGGTTCATTGAAAAGTTCAGATATAGATCAGATAAATGGTGCATGTGGT
AATAATGCCTCGGAGAAGATTGTGGAAGGTGATGTGGAGGAAGCTCTTGTTTGCTGTTCTGTTGGTGATCCTGATGATGAAACGTCATCAAGCAAGGAATGGATTATACC
AACTCCCTCCGGCATGCCTCCTGAACTGGAAAATGCTGAAACTGTGAAGCAAGAAGTTGTATGTTTCTCAGCTTCTGGTGAGACAAGCAGCGGTGTTAATGCTATCACTG
AAGAGAGAACTCCATCGCAGGTATTGGATACATCAGAGAAAGGAGATTCTATTGGCTGTTCAAGGAAAAAGCTTCTTGTTCTCGATGTAAATGGATTGCTTGCAGATTTT
ATTTGTTACGTTCCTTATGGATATAAGCCAGACATTGTAATAAGACAAAAAGCAGTATTCAAGAGGCCATTTTGTGATGATTTTATAAAGTTTTGTTTTGAAAGATTCGA
GGTGGGTGTTTGGTCGTCAAGAACTCGGAGAAATGTGGACATGGTGATAGATTTTCTAATGGGAGATTTGAGGCGAAAATTACTATTTTGCTGGGATCAATCACATTGTA
CCGATACCAAGTTCTCTACGGTTGAGAATAAGCACAAGCCTTTAGTCTTAAAGGAAATTAGAAAACTGTGGAAATACCTTAAGCCACGAGAGTTTAATGCATCAAACACT
CTATTGTTGGATGATTCCCCACACAAGGCATTGTGCAATCCGGCAAACACTGCAATATTTCCTGTAACATATCGGTTTAGGGATACCGACGATACGTCGTTAGGACCGGG
AGGCGATCTTCGGGTTTACTTGGAAGGTTTATCAATGGCAGAAAATGTTCAAAAATATGTTGAGCAGAATCCGTTTGGTCAACGTCCTATTACAGAAAAGAACCTGTCTT
GGAAGTTTTATCGACGGATCATATATTTTGTCGAGCGCCAAAACGATCAGGATGATACCAATTCTTACAAATGGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATATTTCAGAATGTGATACAAGTGAAGATTTGGAGCATAAAATGAAAAAGAGAAAGCTAGAACAGTTTGATAATGCTCCTGAAGATAGCAATACGGGCAGTGTTCA
TTCTGGTTCTGAAGATGCGTCTTCAATGGATAATATCCTGTCGGAAAATGATCCAGCTCCTGATGCCACGTTCATTAAATGCTCAAAGCTCGAATCTGAAACAGGACAAA
CTCATCCAGAGATATATAATCCAAAGGCAAATGATGTTCATGAAAAGGAACATGATGATGACCAAAAATTGTTAAAGGGCATTGCTACAGAGCATGAAAATATGAATGGT
TCTGGTAATCTTATCCTGAATACAGCAGATGTCAAACAAAATGTTGCTAGATTTAGTGTCAAAATGGAAGAGCCAAGTTCCACGAAAGCTTACGAAGAAGACCCTAGGAT
CTCTGAAGATCTTGGTGGTATGAGAGATCATGATGATCATGGAAACTTTGCCAATGTTGGCCAAGAACTGAGCCAGGAGATGATAGATGTGAGGAAGGATGATCATTCTA
GAGAAAAATTCTCTGACGCTGGTTATCAATTTCCATGTAATGAGCAGGAATATGAGAGGGATGGTTCATTGAAAAGTTCAGATATAGATCAGATAAATGGTGCATGTGGT
AATAATGCCTCGGAGAAGATTGTGGAAGGTGATGTGGAGGAAGCTCTTGTTTGCTGTTCTGTTGGTGATCCTGATGATGAAACGTCATCAAGCAAGGAATGGATTATACC
AACTCCCTCCGGCATGCCTCCTGAACTGGAAAATGCTGAAACTGTGAAGCAAGAAGTTGTATGTTTCTCAGCTTCTGGTGAGACAAGCAGCGGTGTTAATGCTATCACTG
AAGAGAGAACTCCATCGCAGGTATTGGATACATCAGAGAAAGGAGATTCTATTGGCTGTTCAAGGAAAAAGCTTCTTGTTCTCGATGTAAATGGATTGCTTGCAGATTTT
ATTTGTTACGTTCCTTATGGATATAAGCCAGACATTGTAATAAGACAAAAAGCAGTATTCAAGAGGCCATTTTGTGATGATTTTATAAAGTTTTGTTTTGAAAGATTCGA
GGTGGGTGTTTGGTCGTCAAGAACTCGGAGAAATGTGGACATGGTGATAGATTTTCTAATGGGAGATTTGAGGCGAAAATTACTATTTTGCTGGGATCAATCACATTGTA
CCGATACCAAGTTCTCTACGGTTGAGAATAAGCACAAGCCTTTAGTCTTAAAGGAAATTAGAAAACTGTGGAAATACCTTAAGCCACGAGAGTTTAATGCATCAAACACT
CTATTGTTGGATGATTCCCCACACAAGGCATTGTGCAATCCGGCAAACACTGCAATATTTCCTGTAACATATCGGTTTAGGGATACCGACGATACGTCGTTAGGACCGGG
AGGCGATCTTCGGGTTTACTTGGAAGGTTTATCAATGGCAGAAAATGTTCAAAAATATGTTGAGCAGAATCCGTTTGGTCAACGTCCTATTACAGAAAAGAACCTGTCTT
GGAAGTTTTATCGACGGATCATATATTTTGTCGAGCGCCAAAACGATCAGGATGATACCAATTCTTACAAATGGAACTGA
Protein sequenceShow/hide protein sequence
MDISECDTSEDLEHKMKKRKLEQFDNAPEDSNTGSVHSGSEDASSMDNILSENDPAPDATFIKCSKLESETGQTHPEIYNPKANDVHEKEHDDDQKLLKGIATEHENMNG
SGNLILNTADVKQNVARFSVKMEEPSSTKAYEEDPRISEDLGGMRDHDDHGNFANVGQELSQEMIDVRKDDHSREKFSDAGYQFPCNEQEYERDGSLKSSDIDQINGACG
NNASEKIVEGDVEEALVCCSVGDPDDETSSSKEWIIPTPSGMPPELENAETVKQEVVCFSASGETSSGVNAITEERTPSQVLDTSEKGDSIGCSRKKLLVLDVNGLLADF
ICYVPYGYKPDIVIRQKAVFKRPFCDDFIKFCFERFEVGVWSSRTRRNVDMVIDFLMGDLRRKLLFCWDQSHCTDTKFSTVENKHKPLVLKEIRKLWKYLKPREFNASNT
LLLDDSPHKALCNPANTAIFPVTYRFRDTDDTSLGPGGDLRVYLEGLSMAENVQKYVEQNPFGQRPITEKNLSWKFYRRIIYFVERQNDQDDTNSYKWN